Save on an individual DataCamp subscription and commit to learning data science and analytics.

Offer ends in

Premium Project

Exploring 67 years of LEGO

In this project we will explore a database of every LEGO set ever built.

Start Project
  • 7 tasks
  • 28,561 participants
  • 1,500 XP

Project Description

The Rebrickable database includes data on every LEGO set that ever been sold; the names of the sets, what bricks they contain, what color the bricks are, etc. It might be small bricks, but this is big data! In this project, you will get to explore the Rebrickable database. To do this you need to know your way around pandas dataframes and it's recommended that you take a look at the courses pandas Foundations and Manipulating DataFrames with pandas.

Project Tasks

  • 1Introduction
  • 2Reading Data
  • 3Exploring Colors
  • 4Transparent Colors in Lego Sets
  • 5Explore Lego Sets
  • 6Lego Themes Over Years
  • 7Wrapping It All Up!
Instructor Avatar
Ramnath Vaidyanathan

VP of Product Research at DataCamp

Ramnath Vaidyanathan is the VP of Product Research at DataCamp, where he drives product innovation and data-driven development. He has 10+ years experience doing statistical modeling, machine learning, optimization, retail analytics, and interactive visualizations. He brings a unique perspective to product development, having worked in diverse industries like management consulting, academia, and enterprise software. Prior to joining DataCamp, he worked as a data scientist at Alteryx, leading the roadmap for interactive visualizations and dashboards for predictive analytics. Prior to Alteryx, he was an Assistant Professor of Operations Management in the Desautels Faculty of Management at McGill University. His research primarily focused on the application of predictive analytics and optimization methodologies to improve operational decisions in retailing. He got his Ph.D. in Operations Management from the Wharton School.

See More


  • Python LogoPython
  • Topics

    Data ManipulationData VisualizationImporting & Cleaning Data