IMDb

IMDb

The IMDb database: moderately large, real database of movies.

(BibTeX)

Versions

  • IMDB_1R

  • Imdb_ijs (by Janez Kranjc)

    • add foreign key constrains by removal of violating samples
  • Imdb_small (by Jan Motl)

Dataset details

Associated task:
Regression
Domain:
Entertainment
Data types:
Size:
54 MB
Count of tables:
3
Count of rows:
996,505
Count of columns:
10
Missing values:
No
Compound keys:
No
Loops:
No
Type:
Real
Instance count:
986,583
Target table:
HasRated
Target column:
rating
Target ID:
user_id, item_id
Target timestamp:
?

How to download the dataset

The datasets are publicly available directly from MySQL database.

  1. Open your favourite MySQL client (for example MySQL Workbench)
  2. Use following credentials:
    • hostname: relational.fit.cvut.cz
    • port: 3306
    • username: guest
    • password: relational
  3. Export "IMDB_1R" database (or other version of the dataset, if available) in your favourite format (e.g. CSV or SQL dump).