Loved by learners at thousands of companies
As statistics and machine learning methods become more pervasive, policymakers have also increased their use of these techniques when deciding how to allocate public resources. In this project, you will analyze economic and social development indicators for 22 provinces of Argentina to help plan an education program. This project assumes you can manipulate data frames using `dplyr`, make plots using `ggplot2`, and understand principal component analysis (PCA) and k-means clustering. The data for this project was published by INDEC and includes indicators such as poverty, population, and GDP for each province.
- 1Provinces of Argentina
- 2Most populous, richest provinces
- 3A matrix for PCA
- 4Reducing dimensions
- 5PCA: Variables & Components
- 6Plotting the components
- 7Cluster using K means
- 8Components with colors
- 9Buenos Aires, in a league of its own
- 10The rich provinces
- 11The poor provinces
- 12Planning for public policy
Rafael La Buonora
Data Scientist at Transforma Uruguay
With a background in economics, Rafael works as a data scientist at Transforma Uruguay, assessing the impact of public policy in the welfare of the citizens of Uruguay.
What do other learners have to say?
I've used other sites—Coursera, Udacity, things like that—but DataCamp's been the one that I've stuck with.
Devon Edwards Joseph
Lloyds Banking Group
DataCamp is the top resource I recommend for learning data science.
Harvard Business School
DataCamp is by far my favorite website to learn from.
Decision Science Analytics, USAA