Premium project

Exploring the History of Lego

Use a variety of data manipulation techniques to explore different aspects of Lego's history!

Start Project
7 Tasks1,500 XP42,018 Learners

Loved by learners at thousands of companies

Project Description

The [Rebrickable]( database includes data on every LEGO set that has ever been sold; the names of the sets, what bricks they contain, what color the bricks are, etc. It might be small bricks, but this is big data! In this project, you will get to explore the Rebrickable database and answer a series of questions related to the history of Lego!

Project Tasks

  1. 1
  2. 2
    Reading Data
  3. 3
    Exploring Colors
  4. 4
    Transparent Colors in Lego Sets
  5. 5
    Explore Lego Sets
  6. 6
    Lego Themes Over Years
  7. 7
    Wrapping It All Up!


Python Python


Data ManipulationData VisualizationImporting & Cleaning Data
Ramnath Vaidyanathan Headshot

Ramnath Vaidyanathan

VP of Product Research at DataCamp

Ramnath Vaidyanathan is the VP of Product Research at DataCamp, where he drives product innovation and data-driven development. He has 10+ years experience doing statistical modeling, machine learning, optimization, retail analytics, and interactive visualizations. He brings a unique perspective to product development, having worked in diverse industries like management consulting, academia, and enterprise softwares. Prior to joining DataCamp, he worked as a data scientist at Alteryx, leading the roadmap for interactive visualizations and dashboards for predictive analytics. Prior to Alteryx, he was an Assistant Professor of Operations Management in the Desautels Faculty of Management at McGill University. His research primarily focused on the application of predictive analytics and optimization methodologies to improve operational decisions in retailing. He got his Ph.D. in Operations Management from the Wharton School.
See More

What do other learners have to say?

I've used other sites—Coursera, Udacity, things like that—but DataCamp's been the one that I've stuck with.

Devon Edwards Joseph
Lloyds Banking Group

DataCamp is the top resource I recommend for learning data science.

Louis Maiden
Harvard Business School

DataCamp is by far my favorite website to learn from.

Ronald Bowers
Decision Science Analytics, USAA