In this course you will begin learning the art and science of data exploration. You'll also become familiar with some of the most widely used visualization packages in data science. You'll get the chance to explore a number of fun & interesting datasets such as the 2013 American Community Survey, pigeon racing data, data on the length of chopsticks and Marijuana prices in the US. After completing the course, you'll be able to upload your first Kaggle Script.
Bonus Chapter: Once you have finished your exercises on DataCamp it's time to start building a data science portfolio with your new skills! In this bonus chapter, you'll be given the chance to publish analyses on Kaggle Scripts that you've personalized with information from your own life.
- (Basic) Visualize a dataset on US Baby Names to learn the popularity of your name or a friend's name in the USA over the past 100+ years.
- (Intermediate) Create a polished analysis in RMarkdown that walks readers through recent changes at the college you attended or would like to attend. You will explore changes in enrollment, tuition, admissions rate, and degree types.
- (Advanced) Each of Ben's scripts end with suggestions for creating an even meatier analysis. Which ones can you use to extend your work?
Analyzing the 2013 American Community Survey
In this first chapter you will use data from the 2013 American Community Survey to figure out whether it makes sense to pursue a PhD or not. The end result of this chapter will be your own Kaggle script that you can add to your Kaggle account.
Pigeons, Chopsticks, Spanish Silver and Marijuana
Analyze pigeon racing data, discover the optimal size for your chopsticks, plot Spanish silver production in the 18th century and map the average price of marijuana in the different states of the US.