Hoppa till huvudinnehållet
HemR

Kurs

Handling Missing Data with Imputations in R

AvanceradKunskapsnivå
Uppdaterad 2022-10
Diagnose, visualize and treat missing data with a range of imputation techniques with tips to improve your results.
Starta kursen gratis
RData Manipulation
4 tim
13 videor
49 Övningar
4,200 XP
6,193
Intyg om genomförande

Skapa ditt kostnadsfria konto

Fortsätt med GoogleVisa fler alternativ

eller


Genom att fortsätta godkänner du våra Användarvillkor, vår Integritetspolicy och att dina uppgifter lagras i USA.

Omtyckt av lärande på tusentals företag

Group

Utbildar du ett team?

Prova för företag

Kursbeskrivning

Missing data is everywhere. The process of filling in missing values is known as imputation, and knowing how to correctly fill in missing data is an essential skill if you want to produce accurate predictions and distinguish yourself from the crowd. In this course, you’ll learn how to use visualizations and statistical tests to recognize missing data patterns and how to impute data using a collection of statistical and machine learning models. You’ll also gain decision-making skills, helping you decide which imputation method fits best in a particular situation. Finally, you’ll learn to incorporate uncertainty from imputation into your inference and predictions, making them more robust and reliable.

Förkunskapskrav

Intermediate Regression in RDealing With Missing Data in R
1

The Problem of Missing Data

In this chapter, you’ll find out why missing data can be a risk when analyzing a dataset. You’ll be introduced to the three missing data mechanisms and learn how to recognize them using statistical tests and visualization tools.
Starta kapitel
2

Donor-Based Imputation

Get to know the taxonomy of imputation methods and learn three donor-based techniques: mean, hot-deck, and k-Nearest-Neighbors imputation. You’ll look under the hood to see how these methods work, before learning how to apply them to a real-world tropical weather dataset. Along the way, you’ll also learn useful tricks that you can use to make them work even better for your problems.
Starta kapitel
3

Model-Based Imputation

It’s time to learn how to use statistical and machine learning models, such as linear regression, logistic regression, and random forests, to impute missing data. In this chapter, you’ll look into how the models make their predictions and use this knowledge to draw the imputed values from conditional distributions. This is important as it ensures your imputations are more varied and plausible, making them more similar to the true data.
Starta kapitel
4

Uncertainty from Imputation

Imputed values are not set in stone. They are just estimates and estimates come with some uncertainty. In this final chapter, you’ll discover how bootstrapping and chained equation using the mice package can be used to incorporate imputation uncertainty into your models and analyses to make them more reliable and robust.
Starta kapitel
Handling Missing Data with Imputations in R
Kurs
slutförd

Tjäna ett prestationsbevis

Lägg till det här beviset i din LinkedIn-profil, ditt CV eller din meritförteckning
Dela det i sociala medier och i din medarbetarutvärdering
Registrera dig nu

Gå med 19 miljoner lärande och börja Handling Missing Data with Imputations in R idag!

Skapa ditt kostnadsfria konto

Fortsätt med GoogleVisa fler alternativ

eller


Genom att fortsätta godkänner du våra Användarvillkor, vår Integritetspolicy och att dina uppgifter lagras i USA.

Utveckla dina datakunskaper med DataCamp för mobilen

Gör framsteg när du är på språng med våra mobila kurser och dagliga 5-minuters kodningsutmaningar.