Skip to main content

Saloni Bhandari has completed

Introduction to Feature Engineering in R

Start course For Free
4 hr
3,500 XP
Statement of Accomplishment Badge

Loved by learners at thousands of companies


Course Description

Feature engineering helps you uncover useful insights from your machine learning models. The model building process is iterative and requires creating new features using existing variables that make your model more efficient. In this course, you will explore different data sets and apply a variety of feature engineering techniques to both continuous and discrete variables.
For Business

Training 2 or more people?

Get your team access to the full DataCamp platform, including all the features.
DataCamp for BusinessFor a bespoke solution book a demo.
  1. 1

    Creating Features from Categorical Data

    Free

    In this chapter, you will learn how to change categorical features into numerical representations that models can interpret. You'll learn about one-hot encoding and using binning for categorical features.

    Play Chapter Now
    Introduction to feature engineering in R
    50 xp
    Examples of feature engineering
    50 xp
    One-hot encoding
    100 xp
    Binning encoding: content driven
    50 xp
    Leveraging content knowledge
    100 xp
    Converting new categories to numeric
    100 xp
    Binning encoding: data driven
    50 xp
    Categorical proportions by outcome
    100 xp
    Reducing categories using outcome
    100 xp
  2. 2

    Creating Features from Numeric Data

    In this chapter, you will learn how to manipulate numerical features to create meaningful features that can give better insights into your model. You will also learn how to work with dates in the context of feature engineering.

    Play Chapter Now
  3. 3

    Transforming Numerical Features

    In this chapter, you will learn about using transformation techniques, like Box-Cox and Yeo-Johnson, to address issues with non-normally distributed features. You'll also learn about methods to scale features, including mean centering and z-score standardization.

    Play Chapter Now
For Business

Training 2 or more people?

Get your team access to the full DataCamp platform, including all the features.

collaborators

Collaborator's avatar
Chester Ismay
Collaborator's avatar
Amy Peterson

prerequisites

Exploratory Data Analysis in R
Jose Hernandez HeadshotJose Hernandez

Data Scientist, University of Washington

See More

Join over 18 million learners and start Introduction to Feature Engineering in R today!

Create Your Free Account

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.