Stats

Stats

An anonymized dump of all user-contributed content on the Stats Stack Exchange network.

Original source: archive.org

Versions

  • Stats (by Jan Motl)

Dataset details

Associated task:
Regression
Domain:
Education
Data types:
Size:
658.4 MB
Count of tables:
8
Count of rows:
1,027,838
Count of columns:
71
Missing values:
Yes
Compound keys:
No
Loops:
Yes
Type:
Real
Instance count:
41,793
Target table:
users
Target column:
Reputation
Target ID:
Id
Target timestamp:
LastAccessDate

How to download the dataset

The datasets are publicly available directly from MariaDB database.

  1. Open your favourite MariaDB client (MySQL Workbench works, but see FAQ)
  2. Use following credentials:
    • hostname: relational.fit.cvut.cz
    • port: 3306
    • username: guest
    • password: relational
  3. Export "stats" database (or other version of the dataset, if available) in your favourite format (e.g. CSV or SQL dump).