Nonlinear Modeling in R with GAMs

GAMs model relationships in data as nonlinear functions that are highly adaptable to different types of data science problems.
Start Course for Free
4 Hours15 Videos50 Exercises4,233 Learners
4050 XP

Create Your Free Account

By continuing you accept the Terms of Use and Privacy Policy. You also accept that you are aware that your data will be stored outside of the EU and that you are above the age of 16.

Loved by learners at thousands of companies

Course Description

Generalized Additive Models are a powerful tool for both prediction and inference. More flexible than linear models, and more understandable than black-box methods, GAMs model relationships in data as nonlinear functions that are highly adaptable to different types of data and data science problems. In this course, you'll learn how GAMs work and how to construct them with the popular <strong>mgcv</strong> package. You'll learn how to interpret, explain and visualize your model results, and how to diagnose and fix model problems. You'll work with data sets that will show you how to apply GAMs to a variety of situations: automobile performance data for building mixed linear and nonlinear models, soil pollution data for building geospatial models, and consumer purchasing data for classification and prediction. By the end of this course, you'll have a toolbox for solving many data science problems.

  1. 1

    Introduction to Generalized Additive Models

    In this chapter, you will learn how Generalized additive models work and how to use flexible, nonlinear functions to model data without over-fitting. You will learn to use the gam() function in the mgcv package, and how to build multivariate models that mix nonlinear, linear, and categorical effects to data.
    Play Chapter Now
  2. 2

    Interpreting and Visualizing GAMs

    In this chapter, you will take a closer look at the models you fit in chapter 1 and learn how to interpret and explain them. You will learn how to make plots that show how different variables affect model outcomes. Then you will diagnose problems in models arising from under-fitting the data or hidden relationships between variables, and how to iteratively fix those problems and get better results.
    Play Chapter Now
  3. 3

    Spatial GAMs and Interactions

    In this chapter, you will extend the types of models you can fit to those with interactions of multiple variables. You will fit models of geospatial data by using these interactions to model complex surfaces, and visualize those surfaces in 3D. Then you will learn about interactions between smooth and categorical variables, and how to model interactions between very different variables like space and time.
    Play Chapter Now
  4. 4

    Logistic GAMs for Classification

    In the first three chapters, you used GAMs for regression of continuous outcomes. In this chapter, you will use GAMs for classification. You will build logistic GAMs to predict binary outcomes like customer purchasing behavior, learn to visualize this new type of model, make predictions, and learn how to explain the variables that influence each prediction.
    Play Chapter Now
Insurance (csale) data
Sumedh PanchadharRasmus BååthChester IsmayEunkyung Park
DataCamp Content Creator Headshot

DataCamp Content Creator

Course Instructor
DataCamp offers interactive R, Python, Spreadsheets, SQL and shell courses. All on topics in data science, statistics, and machine learning. Learn from a team of expert teachers in the comfort of your browser with video lessons and fun coding challenges and projects.
See More

What do other learners have to say?

I've used other sites—Coursera, Udacity, things like that—but DataCamp's been the one that I've stuck with.

Devon Edwards Joseph
Lloyds Banking Group

DataCamp is the top resource I recommend for learning data science.

Louis Maiden
Harvard Business School

DataCamp is by far my favorite website to learn from.

Ronald Bowers
Decision Science Analytics, USAA