Create Your Free Account
or
By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.Loved by learners at thousands of companies
Course Description
This course bridges the gap between introductory and advanced courses in Python. While there are many excellent introductory Python courses available, most typically do not go deep enough for you to apply your Python skills to research projects. In this course, after first reviewing the basics of Python 3, we learn about tools commonly used in research settings.\nUsing a combination of a guided introduction and more independent in-depth exploration, you will get to practice your new Python skills with various case studies chosen for their scientific breadth and their coverage of different Python features.This consists of the homeworks for each week.
- 1
Homework 1
FreeExercises for homework (Week 1). In this homework, we will use objects, functions, and randomness to find the length of documents, approximate pi, and smooth out random noise.
- 2
Homework 2
FreeExercises for Homework (Week 2). Tic-tac-toe (or noughts and crosses) is a simple strategy game in which two players take turns placing a mark on a 3x3 board, attempting to make a row, column, or diagonal of three with their mark. In this homework, we will use the tools we've covered in the past two weeks to create a tic-tac-toe simulator and evaluate basic winning strategies.
- 3
Case Study 1 - Caesar Cipher
FreeA cipher is a secret code for a language. In this case study, we will explore a cipher that is reported by contemporary Greek historians to have been used by Julius Caesar to send secret messages to generals during times of war.
- 4
Case Study 2 - Translations of Hamlet
FreeIn this case study, we will find and plot the distribution of word frequencies for each translation of Hamlet. Perhaps the distribution of word frequencies of Hamlet depends on the translation - let's find out!
- 5
Case Study 3 - Practice with Classification
FreeIn this case study, we will analyze a dataset consisting of an assortment of wines classified into "high quality" and "low quality", and will use k-Nearest Neighbors to predict whether or not other information about the wine helps us correctly guess whether a new wine will be of high quality.
- 6
Case Study 4 - Visualizing Whisky Classification
FreeIn this case study, we have prepared step-by-step instructions for you on how to prepare plots in Bokeh, a library designed for simple and interactive plotting. We will demonstrate Bokeh by continuing the analysis of Scotch whiskies.
- 7
Case Study 5 - Bird Migration
FreeIn this case study, we will continue taking a look at patterns of flight for each of the three birds in our dataset.
- 8
Case Study 6 - Social Network Analysis
FreeHomophily is a network characteristic. Homophily occurs when nodes that share an edge share a characteristic more often than nodes that do not share an edge. In this case study, we will investigate homophily of several characteristics of individuals connected in social networks in rural India.
- 9
Case Study 7 - Movie Analysis, Part 1 - Data Preparation
FreeThe [movie dataset on which this case study is based](https://www.kaggle.com/tmdb/tmdb-movie-metadata) is a database of 5000 movies catalogued by [The Movie Database (TMDb)](https://www.themoviedb.org/?language=en). The information available about each movie is its budget, revenue, rating, actors and actresses, etc. In this case study, we will use this dataset to determine whether any information about a movie can predict the total revenue of a movie. We will also attempt to predict whether a movie's revenue will exceed its budget. In Part 1, we will inspect, clean, and transform the data.
- 10
Case Study 7 - Movie Analysis, Part 2 - Modeling
FreeThe [movie dataset on which this case study is based](https://www.kaggle.com/tmdb/tmdb-movie-metadata) is a database of 5000 movies catalogued by [The Movie Database (TMDb)](https://www.themoviedb.org/?language=en). The information available about each movie is its budget, revenue, rating, actors and actresses, etc. In this case study, we will use this dataset to determine whether any information about a movie can predict the total revenue of a movie. We will also attempt to predict whether a movie's revenue will exceed its budget. In Part 2, we will use the dataset prepared in Part 1 for an applied analysis.
Datasets
Adj allvillagerelationships vilno 1Adj allvillagerelationships vilno 2Bird trackingGettysburgIndividual characteristicsKey vilno 1Key vilno 2Merged movie dataMerged movie data smallRegionsWhiskiesCollaborators

Patrick Staples
See MorePatrick Staples is a biostatistics post-doctoral fellow at Harvard University. He likes to study epidemic processes in networks, and develops methods to determine clinically relevant behavior from smartphone data.
What do other learners have to say?
Join over 11 million learners and start Using Python for Research today!
Create Your Free Account
or
By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.