Skip to main content
Aljoscha Gerber avatar

Aljoscha Gerber has completed

Data Privacy and Anonymization in R

Start course For Free
4 hr
3,650 XP
Statement of Accomplishment Badge

Loved by learners at thousands of companies


Course Description

With social media and big data everywhere, data privacy has been a growing, public concern. Recognizing this issue, entities such as Google, Apple, and the US Census Bureau are promoting better privacy techniques; specifically differential privacy, a mathematical condition that quantifies privacy risk. In this course, you will learn to code basic data privacy methods and a differentially private algorithm based on various differentially private properties. With these tools in hand, you will learn how to generate a basic synthetic (fake) data set with the differential privacy guarantee for public data release.
For Business

Training 2 or more people?

Get your team access to the full DataCamp platform, including all the features.
DataCamp for BusinessFor a bespoke solution book a demo.
  1. 1

    Introduction to Data Privacy

    Free

    This chapter covers some basic data privacy techniques that statisticians use to anonymize data. You'll first learn how to remove identifiers and then generate synthetic data from probability distributions.

    Play Chapter Now
    Intro to anonymization (I)
    50 xp
    Removing Names
    100 xp
    Rounding Salaries
    100 xp
    Intro to anonymization (II)
    50 xp
    Generalization
    100 xp
    Bottom Coding
    100 xp
    summarize_at()
    100 xp
    count()
    100 xp
    Data synthesis
    50 xp
    Binomial Distribution
    100 xp
    Normal Distribution
    100 xp
  2. 2

    Introduction to Differential Privacy

    After covering the basic data privacy techniques, you'll learn conceptually about differential privacy as well as how to implement the most popular and common differentially private algorithm called the Laplace mechanism.

    Play Chapter Now
For Business

Training 2 or more people?

Get your team access to the full DataCamp platform, including all the features.

datasets

Data sets

collaborators

Collaborator's avatar
Chester Ismay
Collaborator's avatar
Sumedh Panchadhar
Claire Bowen HeadshotClaire Bowen

Postdoctoral Researcher at the Los Alamos National Laboratory

Claire McKay Bowen is a Postdoctoral Researcher in the Statistical Science Group at the Los Alamos National Laboratory. She conducts research in uncertainty quantification with physics-informed Bayesian Model updating and data privacy via differentially private data synthesis methods. Her other interests include statistical computing, scientific communication, and STEM outreach.
See More

Join over 18 million learners and start Data Privacy and Anonymization in R today!

Create Your Free Account

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.