Skip to content
Project: Data-Driven Product Management: Conducting a Market Analysis
  • AI Chat
  • Code
  • Report
  • You are a product manager for a fitness studio and are interested in understanding the current demand for digital fitness classes. You plan to conduct a market analysis in Python to gauge demand and identify potential areas for growth of digital products and services.

    The Data

    You are provided with a number of CSV files in the "Files/data" folder, which offer international and national-level data on Google Trends keyword searches related to fitness and related products.

    workout.csv

    ColumnDescription
    'month'Month when the data was measured.
    'workout_worldwide'Index representing the popularity of the keyword 'workout', on a scale of 0 to 100.

    three_keywords.csv

    ColumnDescription
    'month'Month when the data was measured.
    'home_workout_worldwide'Index representing the popularity of the keyword 'home workout', on a scale of 0 to 100.
    'gym_workout_worldwide'Index representing the popularity of the keyword 'gym workout', on a scale of 0 to 100.
    'home_gym_worldwide'Index representing the popularity of the keyword 'home gym', on a scale of 0 to 100.

    workout_geo.csv

    ColumnDescription
    'country'Country where the data was measured.
    'workout_2018_2023'Index representing the popularity of the keyword 'workout' during the 5 year period.

    three_keywords_geo.csv

    ColumnDescription
    'country'Country where the data was measured.
    'home_workout_2018_2023'Index representing the popularity of the keyword 'home workout' during the 5 year period.
    'gym_workout_2018_2023'Index representing the popularity of the keyword 'gym workout' during the 5 year period.
    'home_gym_2018_2023'Index representing the popularity of the keyword 'home gym' during the 5 year period.
    # Import the necessary libraries
    import pandas as pd
    import matplotlib.pyplot as plt
    # Start coding here
    1. Converting the csv files into pandas
    df3kgeo = pd.read_csv('data/three_keywords_geo.csv', sep=',')
    df3kgeo.info()
    df3kgeo.head(20)

    df3kgeo: Many countries does not have the data about how many times the terms were measured

    df3k = pd.read_csv('data/three_keywords.csv', sep=',')
    df3k.info()
    df3k.head()
    dfwgeo = pd.read_csv('data/workout_geo.csv', sep=',')
    dfwgeo.info()
    dfwgeo.head(20)