Internet: A Global Phenomenon
This dataset contains information on internet access around the world.
The workspace is set up with two CSV files containing information on global internet access for years ranging from 1990 to 2020.
internet_users.csv
users
- The number of people who have used the internet in the last three monthsshare
- The share of the entity's population who have used the internet in the last three months
adoption.csv
fixed_telephone_subs
- The number of people who have a telephone landline connectionfixed_telephone_subs_share
- The share of the entity's population who have a telephone landline connectionfixed_broadband_subs
- The number of people who have a broadband internet landline connectionfixed_broadband_subs_share
- The share of the entity's population who have a broadband internet landline connectionmobile_cell_subs
- The number of people who have a mobile subscriptionmobile_cell_subs_share
- The share of the entity's population who have a mobile subscription
Both data files are indexed on the following 3 attributes:
entity
- The name of the country, region, or group.code
- Unique id for the country (null for other entities).year
- Year from 1990 to 2020.
Check out the guiding questions or the scenario described below to get started with this dataset! Feel free to make this workspace yours by adding and removing cells, or editing any of the existing cells.
Source: Our World In Data
🔍 Scenario: Identify emerging markets for a global internet provider
This scenario helps you develop an end-to-end project for your portfolio.
Background: You work for a global internet provider on a mission to provide affordable Internet access to everybody around the world using satellites. You are tasked with identifying which markets (regions or countries) are most worthwhile to focus efforts on.
Objective: Construct a top 5 list of countries where there is a big opportunity to roll out our services. Try to consider the amount of people not having access to (good) wired or mobile internet and their spending power.
You can query the pre-loaded CSV files using SQL directly. Here’s a sample query:
Group 1: Internet Adoption Trends
How has global internet usage (population share) evolved over the years from 1990 to 2020? What are the top 5 countries with the highest internet use (by population share)? What are the top 5 countries with the highest internet use for some large regions? Are there countries with declining internet adoption rates in recent years? How has the percentage of the population with internet access changed with education levels? Further Analysis: Combine these insights to identify historical trends in internet adoption. Investigate if regions or countries with declining adoption rates share common characteristics or face similar challenges. Analyze the relationship between education levels and internet access to understand the impact of education on digital inclusion.
Group 1 Visualizations: Internet Adoption Trends
Line Chart: Use a line chart to visualize the trend in global internet usage (population share) over the years (1990-2020). Bar Chart: Create a bar chart to display the top 5 countries with the highest internet use in 2020. Regional Heatmap: Generate a heatmap to illustrate internet adoption rates in large regions, making it easy to spot patterns. Area Chart: Use an area chart to visualize the changing population share with internet access by education level over time.
Group 2 : Internet Access Technologies 6. How does the share of the population with fixed broadband connections compare to those with mobile subscriptions?
Are there countries where fixed broadband subscriptions exceed mobile cell subscriptions? How has fixed broadband subscriptions evolved alongside internet usage (population share)? Further Analysis: Examine the coexistence of fixed and mobile internet access in countries. Identify regions where fixed broadband remains the primary mode of internet access. Explore whether countries with more fixed broadband subscriptions tend to have higher internet usage.
Group 2 Visualizations: Internet Access Technologies 5. Stacked Bar Chart: Display the share of the population with fixed broadband connections and mobile subscriptions in a stacked bar chart for selected countries.
Scatter Plot: Create a scatter plot to compare fixed broadband subscriptions and internet usage (population share) for different countries in 2020. Line Chart: Plot the evolution of fixed broadband subscriptions alongside internet usage (population share) over the years.
Group 3: Socioeconomic Factors and Internet Access 9. Is there a correlation between GDP per capita and internet usage (population share)?
How does population density relate to internet usage (population share)? Is there a correlation between government policies (e.g., censorship) and internet adoption rates? Further Analysis: Combine these insights to assess the impact of socioeconomic factors and government policies on internet access. Investigate if countries with higher GDP per capita tend to have greater internet adoption. Explore whether government policies affect internet adoption rates, especially in densely populated regions.
Group 3 Visualizations: Socioeconomic Factors and Internet Access 8. Scatter Plot Matrix: Use a scatter plot matrix to explore correlations between GDP per capita, population density, and internet usage (population share).
Choropleth Map: Create a map with color-coding to visualize government policies and their impact on internet adoption rates.
Group 4: Special Cases and Universal Access 12. Which countries exhibit the greatest digital divide within their population?
Are there regions where mobile cell subscriptions exceed the total population? Which countries have achieved universal internet access (100% population share)? Further Analysis: Examine unique cases of digital divides within countries or regions. Investigate regions where mobile penetration is exceptionally high and understand the reasons behind it. Identify countries that have successfully achieved universal internet access and analyze their strategies for achieving this milestone.
Group 4 Visualizations: Special Cases and Universal Access 10. Box Plot: Use a box plot to identify countries with significant digital divides within their population.
Bubble Chart: Visualize regions where mobile cell subscriptions exceed the total population using a bubble chart. Highlighted Map: Create a map highlighting countries that have achieved universal internet access (100% population share).
Group 5: Future Projections and Implications 15. What is the median internet usage (population share) across all countries in 2020?
What is the average growth rate of internet usage (population share) over the past decade? How do countries with high internet usage (population share) differ in terms of online activities, such as e-commerce or social media usage? Further Analysis: Combine these insights to project future internet adoption trends. Predict the median internet usage across countries in the coming years based on growth rates. Investigate how high internet usage countries engage in various online activities and assess the implications for businesses and digital platforms.
By grouping and further analyzing these insights, you can develop a comprehensive understanding of global internet access dynamics, including historical trends, socioeconomic influences, and future projections. This holistic view can inform policy decisions, business strategies, and initiatives aimed at bridging the digital divide and promoting equitable internet access worldwide.
Group 5 Visualizations: Future Projections and Implications Box and Whisker Plot: Use a box and whisker plot to display the median internet usage (population share) across all countries in 2020. Line Chart with Trendline: Plot the average growth rate of internet usage (population share) over the past decade for selected countries. Radar Chart: Create a radar chart to compare countries with high internet usage in terms of various online activities.
SELECT *
FROM 'internet_users.csv'
LIMIT 10
import pandas as pd
internet_users = pd.read_csv('internet_users.csv')
internet_users.head()
adoption = pd.read_csv('adoption.csv')
adoption.head()