Stats

Stats

An anonymized dump of all user-contributed content on the Stats Stack Exchange network.

Original source: archive.org

Versions

  • Stats (by Jan Motl)

Dataset details

Associated task:
Regression
Domain:
Education
Data types:
Size:
658.4 MB
Count of tables:
8
Count of rows:
1,027,838
Count of columns:
71
Missing values:
Yes
Compound keys:
No
Loops:
Yes
Type:
Real
Instance count:
41,793
Target table:
users
Target column:
Reputation
Target ID:
Id
Target timestamp:
LastAccessDate

How to download the dataset

The datasets are publicly available directly from MySQL database.

  1. Open your favourite MySQL client (for example MySQL Workbench)
  2. Use following credentials:
    • hostname: relational.fit.cvut.cz
    • port: 3306
    • username: guest
    • password: relational
  3. Export "stats" database (or other version of the dataset, if available) in your favourite format (e.g. CSV or SQL dump).