Skip to content

Video Games Sales Data

This dataset contains records of popular video games in North America, Japan, Europe and other parts of the world. Every video game in this dataset has at least 100k global sales.

import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns

Data Dictionary

ColumnExplanation
RankRanking of overall sales
NameName of the game
PlatformPlatform of the games release (i.e. PC,PS4, etc.)
YearYear the game was released in
GenreGenre of the game
PublisherPublisher of the game
NA_SalesNumber of sales in North America (in millions)
EU_SalesNumber of sales in Europe (in millions)
JP_SalesNumber of sales in Japan (in millions)
Other_SalesNumber of sales in other parts of the world (in millions)
Global_SalesNumber of total sales (in millions)

Source of dataset.

games = pd.read_csv("vgsales.csv", index_col=0)
games
games.info()
games.isna().sum()
games.drop_duplicates(inplace =True)
games.shape
games.describe()

I Which of the three seventh generation consoles (Xbox 360, Playstation 3, and Nintendo Wii) had the highest total sales globally?

games_top_platform = games.groupby('Platform')['Global_Sales'].sum()
games_top_platform = games_top_platform.sort_values(ascending=False)
games_top_platform.head()

II Visualiqsation of the average sales for games in the most popular three genres & difference between NA, EU, and global sales.

games['Genre'].unique()
games_Global = games.groupby('Genre')['Global_Sales'].sum().sort_values(ascending=False).nlargest(3)
games_Global