Skip to main content
HomePythonStatistical Thinking in Python (Part 1)

# Statistical Thinking in Python (Part 1)

4.6+
30 reviews
Intermediate

Build the foundation you need to think statistically and to speak the language of your data.

Start Course for Free
3 hours18 videos61 exercises
179,440 learnersStatement of Accomplishment

## Create Your Free Account

GoogleLinkedInFacebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.
Training 2 or more people?Try DataCamp For Business

## Course Description

After all of the hard work of acquiring data and getting them into a form you can work with, you ultimately want to make clear, succinct conclusions from them. This crucial last step of a data analysis pipeline hinges on the principles of statistical inference. In this course, you will start building the foundation you need to think statistically, speak the language of your data, and understand what your data is telling you. The foundations of statistical thinking took decades to build, but can be grasped much faster today with the help of computers. With the power of Python-based tools, you will rapidly get up-to-speed and begin thinking statistically by the end of this course.
For Business

### .css-1goj2uy{margin-right:8px;}Group.css-gnv7tt{font-size:20px;font-weight:700;white-space:nowrap;}.css-12nwtlk{box-sizing:border-box;margin:0;min-width:0;color:#05192D;font-size:16px;line-height:1.5;font-size:20px;font-weight:700;white-space:nowrap;}Training 2 or more people?

Get your team access to the full DataCamp library, with centralized reporting, assignments, projects and more
Try DataCamp for BusinessFor a bespoke solution book a demo.
1. 1

### Graphical Exploratory Data Analysis

Free

Before diving into sophisticated statistical inference techniques, you should first explore your data by plotting them and computing simple summary statistics. This process, called exploratory data analysis, is a crucial first step in statistical analysis of data.

Play Chapter Now
Introduction to Exploratory Data Analysis
50 xp
What is the goal of statistical inference?
50 xp
Advantages of graphical EDA
50 xp
Plotting a histogram
50 xp
Plotting a histogram of iris data
100 xp
Axis labels!
100 xp
Adjusting the number of bins in a histogram
100 xp
Plot all of your data: Bee swarm plots
50 xp
Bee swarm plot
100 xp
Interpreting a bee swarm plot
50 xp
Plot all of your data: ECDFs
50 xp
Computing the ECDF
100 xp
Plotting the ECDF
100 xp
Comparison of ECDFs
100 xp
Onward toward the whole story!
50 xp
2. 2

### Quantitative Exploratory Data Analysis

In this chapter, you will compute useful summary statistics, which serve to concisely describe salient features of a dataset with a few numbers.

3. 3

### Thinking Probabilistically-- Discrete Variables

Statistical inference rests upon probability. Because we can very rarely say anything meaningful with absolute certainty from data, we use probabilistic language to make quantitative statements about data. In this chapter, you will learn how to think probabilistically about discrete quantities: those that can only take certain values, like integers.

4. 4

### Thinking Probabilistically-- Continuous Variables

It’s time to move onto continuous variables, such as those that can take on any fractional value. Many of the principles are the same, but there are some subtleties. At the end of this final chapter, you will be speaking the probabilistic language you need to launch into the inference techniques covered in the sequel to this course.

For Business

### GroupTraining 2 or more people?

Get your team access to the full DataCamp library, with centralized reporting, assignments, projects and more

datasets

2008 election results (all states)2008 election results (swing states)Belmont StakesSpeed of light

collaborators

prerequisites

Python Toolbox
Justin Bois

Lecturer at the California Institute of Technology

Justin Bois is a Teaching Professor in the Division of Biology and Biological Engineering at the California Institute of Technology. He teaches nine different classes there, nearly all of which heavily feature Python. He is dedicated to empowering students in the biological sciences with quantitative tools, particularly data analysis skills. Beyond biologists, he is thrilled to develop courses for DataCamp, whose students are an excited bunch of burgeoning data scientists!
See More

## Don’t just take our word for it

*4.6
from 30 reviews
70%
23%
7%
0%
0%
Sort by
• Abe A.
10 days

Amazing course, one of the best on DataCamp.

• Vlad P.
8 months

The course provide fundamentals of key operations within EDA, using raw NumPy functions.

• Rachel Z.
10 months

Great introductory course in statistics!

• Vitalis A.
10 months

Great content

• Anand T.
10 months

Clear instruction and explanation of stat aplication in Python

"Amazing course, one of the best on DataCamp."

Abe A.

"The course provide fundamentals of key operations within EDA, using raw NumPy functions."

Vlad P.

"Great introductory course in statistics!"

Rachel Z.

## Create Your Free Account

GoogleLinkedInFacebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.