Skip to main content

Human Resources Analytics: Predicting Employee Churn in R

Predict employee turnover and design retention strategies.

Start Course for Free
4 Hours14 Videos50 Exercises3,663 Learners4000 XP

Create Your Free Account



By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA. You confirm you are at least 16 years old (13 if you are an authorized Classrooms user).

Loved by learners at thousands of companies

Course Description

Organizational growth largely depends on staff retention. Losing employees frequently impacts the morale of the organization and hiring new employees is more expensive than retaining existing ones. Good news is that organizations can increase employee retention using data-driven intervention strategies. This course focuses on data acquisition from multiple HR sources, exploring and deriving new features, building and validating a logistic regression model, and finally, show how to calculate ROI for a potential retention strategy.

  1. 1



    This chapter begins with a general introduction to employee churn/turnover and reasons for turnover as shared by employees. You will learn how to calculate turnover rate and explore turnover rate across different dimensions. You will also identify talent segments for your analysis and bring together relevant data from multiple HR data sources to derive more useful insights.

    Play Chapter Now
    What is turnover?
    50 xp
    Objectives of employee turnover prediction
    50 xp
    Importing headcount and turnover data
    100 xp
    Exploring the data
    50 xp
    What proportion of employees have left?
    100 xp
    Which levels have high turnover rate?
    100 xp
    Is turnover rate different across locations?
    100 xp
    HR data architecture
    50 xp
    Filtering the dataset
    100 xp
    Combining HR datasets (I)
    100 xp
    Combining HR datasets (II)
    100 xp
    Master data overview
    100 xp
  2. 2

    Feature Engineering

    In this chapter, you will create new variables from existing data to explain employee turnover. You will analyze compensation data and create compa-ratio to measure pay equity of all employees. To identify the most important variables influencing turnover, you will use the concept of Information Value (IV).

    Play Chapter Now
  3. 4

    Model Validation, HR Interventions, and ROI

    In this chapter, you will calculate the accuracy of your model and categorize employees into specific risk buckets. You will then formulate an intervention strategy and calculate the ROI for this strategy.

    Play Chapter Now


Employee data


sumedhpanchadharSumedh PanchadharsaschaSascha Mayr
Anurag Gupta Headshot

Anurag Gupta

People Analytics Practitioner

Anurag holds post graduate degree in Human Resources Management from XLRI, Jamshedpur. He has several years of experience in setting up People Analytics function for global MNCs and believes that for HR to get a seat at the C-suite table, they should incorporate data driven decisions across the employee lifecycle.
See More
Abhishek Trehan Headshot

Abhishek Trehan

People Analytics Practitioner

Abhishek is an SHRM Certified Professional, a CAP® (Certified Analytics Professional from INFORMS), and a Six Sigma Black Belt. Has worked and held various roles in fortune 500 companies including setting up HR Analytics and Digital HR COE. He has been a visiting faculty to Indian Institute of Management and is a speaker on HR Analytics.
See More

What do other learners have to say?

I've used other sites—Coursera, Udacity, things like that—but DataCamp's been the one that I've stuck with.

Devon Edwards Joseph
Lloyds Banking Group

DataCamp is the top resource I recommend for learning data science.

Louis Maiden
Harvard Business School

DataCamp is by far my favorite website to learn from.

Ronald Bowers
Decision Science Analytics, USAA