Interactive Course

Inference for Categorical Data

In this course you'll learn how to leverage statistical techniques for working with categorical data.

  • 4 hours
  • 14 Videos
  • 53 Exercises
  • 1,360 Participants
  • 4,000 XP

Loved by learners at thousands of top companies:

rei-grey.svg
deloitte-grey.svg
uber-grey.svg
mls-grey.svg
ikea-grey.svg
mercedes-grey.svg

Course Description

Categorical data is all around us. It's in the latest opinion polling numbers, in the data that lead to new breakthroughs in genomics, and in the troves of data that internet companies collect to sell products to you. In this course you'll learn techniques for parsing the signal from the noise; tools for identifying when structure in this data represents interesting phenomena and when it is just random noise.

  1. Proportions: testing and power

    This chapter dives deeper into performing hypothesis tests and creating confidence intervals for a single parameter. Then, you'll learn how to perform inference on a difference between two proportions. Finally, this chapter wraps up with an exploration of what happens when you know the null hypothesis is true.

  2. Comparing many parameters: goodness of fit

    The course wraps up with two case studies using election data. Here, you'll learn how to use a Chi-squared test to check goodness-of-fit. You'll study election results from Iran and Iowa and test if Benford's law applies to these datasets.

  1. 1

    Inference for a single parameter

    Free

    In this chapter you will learn how to perform statistical inference on a single parameter that describes categorical data. This includes both resampling based methods and approximation based methods for a single proportion.

  2. Proportions: testing and power

    This chapter dives deeper into performing hypothesis tests and creating confidence intervals for a single parameter. Then, you'll learn how to perform inference on a difference between two proportions. Finally, this chapter wraps up with an exploration of what happens when you know the null hypothesis is true.

  3. Comparing many parameters: independence

    This part of the course will teach you how to use both resampling methods and classical methods to test for the indepence of two categorical variables. This chapter covers how to perform a Chi-squared test.

  4. Comparing many parameters: goodness of fit

    The course wraps up with two case studies using election data. Here, you'll learn how to use a Chi-squared test to check goodness-of-fit. You'll study election results from Iran and Iowa and test if Benford's law applies to these datasets.

What do other learners have to say?

Devon

“I've used other sites, but DataCamp's been the one that I've stuck with.”

Devon Edwards Joseph

Lloyd's Banking Group

Louis

“DataCamp is the top resource I recommend for learning data science.”

Louis Maiden

Harvard Business School

Ronbowers

“DataCamp is by far my favorite website to learn from.”

Ronald Bowers

Decision Science Analytics @ USAA

Andrew Bray
Andrew Bray

Assistant Professor of Statistics at Reed College

See More
Collaborators
  • Nick Solomon

    Nick Solomon

  • Benjamin  Feder

    Benjamin Feder

  • Jonathan Ng

    Jonathan Ng

Icon Icon Icon professional info