Skip to content
Eklipse.GG
0. Introduction
Let me introduce myself, my name is Geraldo.
Data Analysis
# Importing stuff
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns
import matplotlib.colors as mcolors
%matplotlib inline
1. Preliminaries
# Importing the data
downloaded= pd.read_csv('datasets/downloaded_clips.csv')
gamesession= pd.read_csv('datasets/gamesession.csv')
clips= pd.read_csv('datasets/clips.csv')
premium= pd.read_csv('datasets/premium_users.csv')
shared_clips= pd.read_csv('datasets/shared_clips.csv')
Checking each of the data
# Downloaded Clips Data
downloaded.head()
# Clips data
clips.head()
# Game Session data
gamesession.head()
# Premium users data
premium.head()
# Shared Clips data
shared_clips.head()
TO-DO
- Understand and highlight the behavioral differences between Free users and Premium users.
- Provide suggestions on how to encourage Free users to become Premium users.
- Offer suggestions on whether the company should focus on some specific games. Explain the reasons behind your suggestions, supported by relevant data.
1.1 Data Cleaning
Dropping Unnamed: 0
column from each table
Unnamed: 0
column from each tabledownloaded = downloaded.drop(columns=['Unnamed: 0'])
gamesession = gamesession.drop(columns=['Unnamed: 0'])
clips = clips.drop(columns=['Unnamed: 0'])
premium = premium.drop(columns=['Unnamed: 0'])
shared_clips = shared_clips.drop(columns=['Unnamed: 0'])