this is the nav!
Exploring the History of Lego
• AI Chat
• Code
• Report

Everyone loves Lego (unless you ever stepped on one). Did you know by the way that "Lego" was derived from the Danish phrase leg godt, which means "play well"? Unless you speak Danish, probably not.

In this project, we will analyze a fascinating dataset on every single Lego block that has ever been built!

`.mfe-app-workspace-11z5vno{font-family:JetBrainsMonoNL,Menlo,Monaco,'Courier New',monospace;font-size:13px;line-height:20px;}`# Nothing to do here``
Hidden output

A comprehensive database of lego blocks is provided by Rebrickable. The data is available as csv files and the schema is shown below.

Let us start by reading in the colors data to get a sense of the diversity of Lego sets!

``````import pandas as pd
``````

### 3. Exploring Colors

Now that we have read the `colors` data, we can start exploring it! Let us start by understanding the number of colors available.

``````num_colors = colors.shape[0]
num_colors
``````

### 4. Transparent Colors in Lego Sets

The `colors` data has a column named `is_trans` that indicates whether a color is transparent or not. It would be interesting to explore the distribution of transparent vs. non-transparent colors.

``````colors_summary = colors.drop_duplicates(subset = 'name').groupby('is_trans').agg('count')
print(colors_summary)``````

### 5. Explore Lego Sets

Another interesting dataset available in this database is the `sets` data. It contains a comprehensive list of sets over the years and the number of parts that each of these sets contained.

Let us use this data to explore how the average number of parts in Lego sets has varied over the years.

``````%matplotlib inline
parts_by_year = sets.groupby('year')['num_parts'].mean()
parts_by_year.plot()

``````

### 6. Lego Themes Over Years

Lego blocks ship under multiple themes. Let us try to get a sense of how the number of themes shipped has varied over the years.

``````themes_by_year = sets.groupby('year')[['theme_id']].nunique()
Lego blocks offer an unlimited amount of fun across ages. We explored some interesting trends around colors, parts, and themes. Before we wrap up, let's take a closer look at the `themes_by_year` DataFrame you created in the previous step.
``````# Get the number of unique themes released in 1999