Generalized Linear Models in Python

Extend your regression toolbox with the logistic and Poisson models and learn to train, understand, and validate them, as well as to make predictions.
Start Course for Free
5 Hours16 Videos59 Exercises5,561 Learners
4950 XP

Create Your Free Account

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA. You confirm you are at least 16 years old (13 if you are an authorized Classrooms user).

Loved by learners at thousands of companies

Course Description

Imagine being able to handle data where the response variable is either binary, count, or approximately normal, all under one single framework. Well, you don't have to imagine. Enter the Generalized Linear Models in Python course! In this course you will extend your regression toolbox with the logistic and Poisson models, by learning how to fit, understand, assess model performance and finally use the model to make predictions on new data. You will practice using data from real world studies such the largest population poisoning in world's history, nesting of horseshoe crabs and counting the bike crossings on the bridges in New York City.

  1. 1

    Introduction to GLMs

    Review linear models and learn how GLMs are an extension of the linear model given different types of response variables. You will also learn the building blocks of GLMs and the technical process of fitting a GLM in Python.
    Play Chapter Now
  2. 2

    Modeling Binary Data

    This chapter focuses on logistic regression. You'll learn about the structure of binary data, the logit link function, model fitting, as well as how to interpret model coefficients, model inference, and how to assess model performance.
    Play Chapter Now
  3. 3

    Modeling Count Data

    Here you'll learn about Poisson regression, including the discussion on count data, Poisson distribution and the interpretation of the model fit. You'll also learn how to overcome problems with overdispersion. Finally, you'll get hands-on experience with the process of model visualization.
    Play Chapter Now
  4. 4

    Multivariable Logistic Regression

    In this final chapter you'll learn how to increase the complexity of your model by adding more than one explanatory variable. You'll practice with the problem of multicollinearity, and with treating categorical and interaction terms in your model.
    Play Chapter Now
Well switch due to arsenic poisoningNesting of the female horseshoe crabCredit defaultLevel of salary and years of work experienceMedical costs per person given age and BMIBike crossings in New York City
Chester IsmayAdrián Soto
Ita Cirovic Donev Headshot

Ita Cirovic Donev

Data Science consultant
Ita is a Data Science consultant. She spends her time finding stories in data and developing predictive models for credit risk using machine learning methods. With the experience of over 15 years, she has worked on diverse problems with many interestingly complex datasets, ranging from loan repayment behavior to a person's spending behavior. Her free time is usually spent in bookstores or reading books.
See More

What do other learners have to say?

I've used other sites—Coursera, Udacity, things like that—but DataCamp's been the one that I've stuck with.

Devon Edwards Joseph
Lloyds Banking Group

DataCamp is the top resource I recommend for learning data science.

Louis Maiden
Harvard Business School

DataCamp is by far my favorite website to learn from.

Ronald Bowers
Decision Science Analytics, USAA