Mini-Assignment 10

Directions

  1. Familiarize yourself with the codebook for the movies dataset below.
  2. Import/load the movies dataset.
  3. Suppose you want to determine whether movie budget is significantly associated with the movie rating. Construct the appropriate regression. Answer Question 1 below.
  4. Suppose you want to determine whether movie budget is significantly associated with movie rating after controlling for MPAA-designation. Construct the appropriate regression. Answer Questions 2-3 below.
  5. There is reason to believe that the relationship between budget and viewer rating may vary differently based on whether the movie is a Comedy. Construct an appropriate graph that allows you to assess this theory. Answer Question 4 below.
  6. Create a subset that includes only Comedies. With this subset, construct a model that determines whether there is a relationship between budget and viewer rating. Answer Question 5 below.
  7. Create a subset that includes only non-Comedies. With this subset, construct a model that determines whether there is a relationship between budget and viewer rating. Answer Questions 6-7 below.

Questions

Question 1: Is budget significantly associated with movie rating? Interpret the term in the model that describes the relationship.

Question 2: Is budget significantly associated with movie rating when controlling for MPAA-designation?

Question 3:  Are NC-17 movies rated significantly differently than R-rated movies when controlling for budget?

Question 4:  Using your graph, does it visually appear that Comedy-status moderates the relationship between budget and viewer rating?

Question 5: Among comedies, is there a relationship between budget and viewer rating?

Question 6: Among non-comedies, is there a relationship between budget and viewer rating?

Question 7: Does the relationship between budget and viewer rating vary based on whether a movie is a comedy?

 


CODEBOOK: Movies Data

The internet movie database, http://imdb.com/, is a website devoted to collecting movie data supplied by studios and fans. It claims to be the biggest movie database on the web and is run by amazon. More about information imdb.com can be found online,http://imdb.com/help/show_leaf?about, including information about the data collection process,http://imdb.com/help/show_leaf?infosource.

The description of the data is as follows:

  • title. Title of the movie.
  • year. Year of release.
  • budget_millions. Total budget (if known) in US dollars
  • length. Length in minutes.
  • rating. Average IMDB user rating.
  • votes. Number of IMDB users who rated this movie.
  • r1-10. Multiplying by ten gives percentile (to nearest 10%) of users who rated this movie a 1.
  • mpaa. MPAA designation.
  • Action, Animation, Comedy, Drama, Documentary, Romance, Short. Binary variables representing if movie was classified as belonging to that genre.