Loved by learners at thousands of companies
Imagine being able to handle data where the response variable is either binary, count, or approximately normal, all under one single framework. Well, you don't have to imagine. Enter the Generalized Linear Models in Python course! In this course you will extend your regression toolbox with the logistic and Poisson models, by learning how to fit, understand, assess model performance and finally use the model to make predictions on new data. You will practice using data from real world studies such the largest population poisoning in world's history, nesting of horseshoe crabs and counting the bike crossings on the bridges in New York City.
Introduction to GLMsFree
Review linear models and learn how GLMs are an extension of the linear model given different types of response variables. You will also learn the building blocks of GLMs and the technical process of fitting a GLM in Python.Going beyond linear regression50 xpApplying linear models50 xpLinear model, a special case of GLM100 xpHow to build a GLM?50 xpData type and distribution family100 xpLinear model and a binary response variable100 xpComparing predicted values100 xpHow to fit a GLM in Python?50 xpModel fitting step-by-step100 xpResults of the model fit using summary()100 xpExtracting parameter estimates100 xp
Modeling Binary Data
This chapter focuses on logistic regression. You'll learn about the structure of binary data, the logit link function, model fitting, as well as how to interpret model coefficients, model inference, and how to assess model performance.Binary data and logistic regression50 xpCompute odds and probabilities100 xpFit logistic regression100 xpInterpreting coefficients50 xpCoefficients in terms of odds100 xpModel formula50 xpInterpreting logistic model50 xpRate of change in probability100 xpInterpreting model inference50 xpStatistical significance100 xpComputing Wald statistic100 xpConfidence intervals100 xpComputing and describing predictions50 xpVisualize model fit using regplot()100 xpCompute predictions100 xpCompute confusion matrix100 xp
Modeling Count Data
Here you'll learn about Poisson regression, including the discussion on count data, Poisson distribution and the interpretation of the model fit. You'll also learn how to overcome problems with overdispersion. Finally, you'll get hands-on experience with the process of model visualization.Count data and Poisson distribution50 xpVisualize the response100 xpFitting a Poisson regression100 xpInterpreting model fit50 xpEstimate parameter lambda100 xpInterpret Poisson coefficients100 xpPoisson confidence intervals100 xpThe Problem of Overdispersion50 xpIs the mean equal to the variance?100 xpComputing expected number of counts100 xpChecking for overdispersion100 xpFitting negative binomial100 xpConfidence intervals for negative Binomial model100 xpPlotting a regression model50 xpPlotting data and linear model fit100 xpPlotting fitted values100 xp
Multivariable Logistic Regression
In this final chapter you'll learn how to increase the complexity of your model by adding more than one explanatory variable. You'll practice with the problem of multicollinearity, and with treating categorical and interaction terms in your model.Multivariable logistic regression50 xpFit a multivariable logistic regression100 xpThe effect of multicollinearity100 xpCompute VIF100 xpComparing models50 xpChecking model fit100 xpCompare two models100 xpDeviance and linear transformation100 xpModel formula50 xpModel matrix for continuous variables100 xpVariable transformation100 xpCoding categorical variables100 xpCategorical and interaction terms50 xpModeling with categorical variable100 xpInteraction terms100 xpCongratulations!50 xp
DatasetsWell switch due to arsenic poisoningNesting of the female horseshoe crabCredit defaultLevel of salary and years of work experienceMedical costs per person given age and BMIBike crossings in New York City
PrerequisitesIntroduction to Linear Modeling in Python
Ita Cirovic Donev
Data Science consultant
Ita is a Data Science consultant. She spends her time finding stories in data and developing predictive models for credit risk using machine learning methods. With the experience of over 15 years, she has worked on diverse problems with many interestingly complex datasets, ranging from loan repayment behavior to a person's spending behavior. Her free time is usually spent in bookstores or reading books.
What do other learners have to say?
I've used other sites—Coursera, Udacity, things like that—but DataCamp's been the one that I've stuck with.
Devon Edwards Joseph
Lloyds Banking Group
DataCamp is the top resource I recommend for learning data science.
Harvard Business School
DataCamp is by far my favorite website to learn from.
Decision Science Analytics, USAA