About

IMDB is the biggest movie database online with data publicly available for downloand. We downloaded data from 1913 to 2014 using some modified ruby scripts based off Hadley’s originals, and parsed them into format suitable for analysis. The output is a tidy dataset with 5944 films and the following fields:

We analyzed this dataset. This dashboard presents some of the charts we made using the ezplot R package.

Made by Cabaceo LLC.

Films & Votes

Row

Number of films over the years

Number of votes over the years

Row

Average votes per film over the years

Number of films over 4 periods

Budget & Boxoffice

Row

Distributions of budget and boxoffice over 4 periods

Total budget and boxoffice over the years

Row

Average budget and boxoffice over the years

Boxoffice/budget over the years

User Rating

Row

Distribution of votes for r1-10

Distribution of users’ average ratings

Row

Median of the average ratings over the years

Standard deviation of the average ratings over the years

MPAA, Length and Genre

Row

Distribution of MPAA film ratings

Distribution of film lengths

Row

Distribution of film genres

Top 5 genres for each period

Boxoffice vs. Other Vars

Row

Boxoffice vs. budget

Votes vs. boxoffice

Row

Users’ average rating vs. boxoffice

Boxoffice vs. MPAA films rating

Boxoffice/Budget vs. Other Vars

Row

Boxoffice/budget over the years

Users’ average rating vs. boxoffice/budget

Row

Votes vs. boxoffice/budget

Boxoffice/budget vs. MPAA films rating