Skip to content

0. Introduction

Let me introduce myself, my name is Geraldo.

Data Analysis

# Importing stuff
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns
import matplotlib.colors as mcolors

%matplotlib inline

1. Preliminaries

# Importing the data
downloaded= pd.read_csv('datasets/downloaded_clips.csv')
gamesession= pd.read_csv('datasets/gamesession.csv')
clips= pd.read_csv('datasets/clips.csv')
premium= pd.read_csv('datasets/premium_users.csv')
shared_clips= pd.read_csv('datasets/shared_clips.csv')

Checking each of the data

# Downloaded Clips Data
downloaded.head()
# Clips data
clips.head()
# Game Session data
gamesession.head()
# Premium users data
premium.head()
# Shared Clips data
shared_clips.head()

TO-DO

  1. Understand and highlight the behavioral differences between Free users and Premium users.
  2. Provide suggestions on how to encourage Free users to become Premium users.
  3. Offer suggestions on whether the company should focus on some specific games. Explain the reasons behind your suggestions, supported by relevant data.

1.1 Data Cleaning

Dropping Unnamed: 0 column from each table

downloaded = downloaded.drop(columns=['Unnamed: 0'])
gamesession = gamesession.drop(columns=['Unnamed: 0'])
clips = clips.drop(columns=['Unnamed: 0'])
premium = premium.drop(columns=['Unnamed: 0'])
shared_clips = shared_clips.drop(columns=['Unnamed: 0'])