
Loved by learners at thousands of companies
Course Description
This course bridges the gap between introductory and advanced courses in Python. While there are many excellent introductory Python courses available, most typically do not go deep enough for you to apply your Python skills to research projects. In this course, after first reviewing the basics of Python 3, we learn about tools commonly used in research settings.\nUsing a combination of a guided introduction and more independent in-depth exploration, you will get to practice your new Python skills with various case studies chosen for their scientific breadth and their coverage of different Python features.This consists of the homeworks for each week.
Training 2 or more people?
Get your team access to the full DataCamp platform, including all the features.- 1Homework 1FreeExercises for homework (Week 1). In this homework, we will use objects, functions, and randomness to find the length of documents, approximate pi, and smooth out random noise. 
- 2Homework 2FreeExercises for Homework (Week 2). Tic-tac-toe (or noughts and crosses) is a simple strategy game in which two players take turns placing a mark on a 3x3 board, attempting to make a row, column, or diagonal of three with their mark. In this homework, we will use the tools we've covered in the past two weeks to create a tic-tac-toe simulator and evaluate basic winning strategies. 
- 3Case Study 1 - Caesar CipherFreeA cipher is a secret code for a language. In this case study, we will explore a cipher that is reported by contemporary Greek historians to have been used by Julius Caesar to send secret messages to generals during times of war. 
- 4Case Study 2 - Translations of HamletFreeIn this case study, we will find and plot the distribution of word frequencies for each translation of Hamlet. Perhaps the distribution of word frequencies of Hamlet depends on the translation - let's find out! 
- 5Case Study 3 - Practice with ClassificationFreeIn this case study, we will analyze a dataset consisting of an assortment of wines classified into "high quality" and "low quality", and will use k-Nearest Neighbors to predict whether or not other information about the wine helps us correctly guess whether a new wine will be of high quality. 
- 6Case Study 4 - Visualizing Whisky ClassificationFreeIn this case study, we have prepared step-by-step instructions for you on how to prepare plots in Bokeh, a library designed for simple and interactive plotting. We will demonstrate Bokeh by continuing the analysis of Scotch whiskies. 
- 7Case Study 5 - Bird MigrationFreeIn this case study, we will continue taking a look at patterns of flight for each of the three birds in our dataset. 
- 8Case Study 6 - Social Network AnalysisFreeHomophily is a network characteristic. Homophily occurs when nodes that share an edge share a characteristic more often than nodes that do not share an edge. In this case study, we will investigate homophily of several characteristics of individuals connected in social networks in rural India. 
- 9Case Study 7 - Movie Analysis, Part 1 - Data PreparationFreeThe movie dataset on which this case study is based is a database of 5000 movies catalogued by The Movie Database (TMDb). The information available about each movie is its budget, revenue, rating, actors and actresses, etc. In this case study, we will use this dataset to determine whether any information about a movie can predict the total revenue of a movie. We will also attempt to predict whether a movie's revenue will exceed its budget. In Part 1, we will inspect, clean, and transform the data. 
- 10Case Study 7 - Movie Analysis, Part 2 - ModelingFreeThe movie dataset on which this case study is based is a database of 5000 movies catalogued by The Movie Database (TMDb). The information available about each movie is its budget, revenue, rating, actors and actresses, etc. In this case study, we will use this dataset to determine whether any information about a movie can predict the total revenue of a movie. We will also attempt to predict whether a movie's revenue will exceed its budget. In Part 2, we will use the dataset prepared in Part 1 for an applied analysis. 
Training 2 or more people?
Get your team access to the full DataCamp platform, including all the features.datasets
Adj allvillagerelationships vilno 1Adj allvillagerelationships vilno 2Bird trackingGettysburgIndividual characteristicsKey vilno 1Key vilno 2Merged movie dataMerged movie data smallRegionsWhiskiescollaborators

 Patrick Staples
Patrick StaplesPatrick Staples is a biostatistics post-doctoral fellow at Harvard University.  He likes to study epidemic processes in networks, and develops methods to determine clinically relevant behavior from smartphone data.
Join over 18 million learners and start Using Python for Research today!
Create Your Free Account
or
By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.