Skip to main content
Premium project

Investigating Netflix Movies and Guest Stars in The Office

Apply the foundational Python skills you learned in Introduction to Python and Intermediate Python by manipulating and visualizing movie and TV data.

Start Project
10 Tasks1,500 XP56,967 Learners

Loved by learners at thousands of companies


Project Description

In this project, you’ll apply the skills you learned in Introduction to Python and Intermediate Python to solve a real-world data science problem. You’ll press “watch next episode” to discover if Netflix’s movies are getting shorter over time and which guest stars appear in the most popular episode of "The Office", using everything from lists and loops to pandas and matplotlib. You’ll also gain experience in an essential data science skill — exploratory data analysis. This will allow you to perform critical tasks such as manipulating raw data and drawing conclusions from plots you create of the data. Press play to begin!

Project Tasks

  1. 1
    Loading your friend's data into a dictionary
  2. 2
    Creating a DataFrame from a dictionary
  3. 3
    A visual inspection of our data
  4. 4
    Loading the rest of the data from a CSV
  5. 5
    Filtering for movies!
  6. 6
    Creating a scatter plot
  7. 7
    Digging deeper
  8. 8
    Marking non-feature films
  9. 9
    Plotting with color!
  10. 10
    What next?

Technologies

Python Python

Topics

Data ManipulationData VisualizationProgramming
Justin Saddlemyer Headshot

Justin Saddlemyer

Justin is a Workspace Content Developer at DataCamp. He holds a bachelor's degree in psychology from St. Francis Xavier University, and a graduate degree in social psychology from VU Amsterdam. In 2016 Justin received a PhD in marketing from KU Leuven.
See More

What do other learners have to say?

I've used other sites—Coursera, Udacity, things like that—but DataCamp's been the one that I've stuck with.

Devon Edwards Joseph
Lloyds Banking Group

DataCamp is the top resource I recommend for learning data science.

Louis Maiden
Harvard Business School

DataCamp is by far my favorite website to learn from.

Ronald Bowers
Decision Science Analytics, USAA