Saloni Bhandari has completed

Introduction to Feature Engineering in R

4 hr

3,500 XP

Loved by learners at thousands of companies

Course Description

Feature engineering helps you uncover useful insights from your machine learning models. The model building process is iterative and requires creating new features using existing variables that make your model more efficient. In this course, you will explore different data sets and apply a variety of feature engineering techniques to both continuous and discrete variables.

For Business

Training 2 or more people?

Get your team access to the full DataCamp platform, including all the features.

1
Creating Features from Categorical Data
Free
In this chapter, you will learn how to change categorical features into numerical representations that models can interpret. You'll learn about one-hot encoding and using binning for categorical features.
Play Chapter Now
Introduction to feature engineering in R
50 xp
Examples of feature engineering
50 xp
One-hot encoding
100 xp
Binning encoding: content driven
50 xp
Leveraging content knowledge
100 xp
Converting new categories to numeric
100 xp
Binning encoding: data driven
50 xp
Categorical proportions by outcome
100 xp
Reducing categories using outcome
100 xp
2
Creating Features from Numeric Data
In this chapter, you will learn how to manipulate numerical features to create meaningful features that can give better insights into your model. You will also learn how to work with dates in the context of feature engineering.
Play Chapter Now
Numerical bucketing or binning
50 xp
Visualizing the distribution
100 xp
Creating uniform buckets from a distribution
100 xp
Binning numerical data using quantiles
50 xp
Balanced bucketing
100 xp
Full matrix encoding
100 xp
Unique attributes of adaptive bucketing
50 xp
Date and time feature extraction
50 xp
Converting string types to date types
100 xp
Converting dates
100 xp
Visualize time features
100 xp
3
Transforming Numerical Features
In this chapter, you will learn about using transformation techniques, like Box-Cox and Yeo-Johnson, to address issues with non-normally distributed features. You'll also learn about methods to scale features, including mean centering and z-score standardization.
Play Chapter Now
Box and Yeo transformations
50 xp
Box-Cox vs. Yeo-Johnson
50 xp
Box-Cox transformations
100 xp
Yeo-Johnson transformations
100 xp
Normalization techniques
50 xp
Scaling
100 xp
Mean centering
100 xp
Caret mean centering
100 xp
Z-score standardization
50 xp
Standardization one variable case
100 xp
Caret standardization
100 xp
4
Advanced Methods
In the final chapter, we will use feature crossing to create features from two or more variables. We will also discuss principal component analysis, and methods to explore and visualize those results.
Play Chapter Now
Feature crossing
50 xp
How many features to expect
50 xp
Exploring features visually
100 xp
Exploring potential crosses
100 xp
Crossing two categorical features
100 xp
Principal component analysis
50 xp
Conduct PCA
100 xp
PCA results
50 xp
Interpreting PCA output
50 xp
Proportion of variance by PCA
100 xp
Visualizing results with a scree plot
100 xp
Visualizing components
100 xp
Wrap-up
50 xp

For Business

Training 2 or more people?

Get your team access to the full DataCamp platform, including all the features.

collaborators

Chester Ismay

Amy Peterson

prerequisites

Exploratory Data Analysis in R

Jose Hernandez

Data Scientist, University of Washington

Join over 18 million learners and start Introduction to Feature Engineering in R today!

Create Your Free Account

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.

Introduction to Feature Engineering in R

Loved by learners at thousands of companies

Course Description

.css-10r9e5n{-webkit-margin-end:8px;margin-inline-end:8px;}.css-1309hh9{-webkit-flex-shrink:0;-ms-flex-negative:0;flex-shrink:0;-webkit-margin-end:8px;margin-inline-end:8px;}Training 2 or more people?

Creating Features from Categorical Data

Creating Features from Numeric Data

Transforming Numerical Features

Advanced Methods

Training 2 or more people?

Join over .css-ou6dz6{color:#03ef62;}18 million learners and start Introduction to Feature Engineering in R today!

Create Your Free Account

Training 2 or more people?

Join over 18 million learners and start Introduction to Feature Engineering in R today!