강의
R에서 대치(Imputation)로 결측치 다루기
고급기술 수준
업데이트됨 2022. 10.
RData Manipulation4시간13 동영상49 연습 문제4,200 XP6,212성취 증명서
무료 계정 만들기
Google에서 계속 진행더 많은 옵션 보기또는
수천 개 기업의 학습자들이 사랑하는
팀을 교육하시나요?
비즈니스용으로 체험해 보세요강의 설명
선수 조건
Intermediate Regression in RDealing With Missing Data in R1
The Problem of Missing Data
In this chapter, you’ll find out why missing data can be a risk when analyzing a dataset. You’ll be introduced to the three missing data mechanisms and learn how to recognize them using statistical tests and visualization tools.
2
Donor-Based Imputation
Get to know the taxonomy of imputation methods and learn three donor-based techniques: mean, hot-deck, and k-Nearest-Neighbors imputation. You’ll look under the hood to see how these methods work, before learning how to apply them to a real-world tropical weather dataset. Along the way, you’ll also learn useful tricks that you can use to make them work even better for your problems.
3
Model-Based Imputation
It’s time to learn how to use statistical and machine learning models, such as linear regression, logistic regression, and random forests, to impute missing data. In this chapter, you’ll look into how the models make their predictions and use this knowledge to draw the imputed values from conditional distributions. This is important as it ensures your imputations are more varied and plausible, making them more similar to the true data.
4
Uncertainty from Imputation
Imputed values are not set in stone. They are just estimates and estimates come with some uncertainty. In this final chapter, you’ll discover how bootstrapping and chained equation using the mice package can be used to incorporate imputation uncertainty into your models and analyses to make them more reliable and robust.
R에서 대치(Imputation)로 결측치 다루기
강의 완료
19백만 명 이상의 학습자와 함께 R에서 대치(Imputation)로 결측치 다루기을(를) 시작하세요!
무료 계정 만들기
Google에서 계속 진행더 많은 옵션 보기또는
DataCamp for Mobile을 통해 데이터 분석 능력을 향상시키세요.
모바일 강좌와 매일 5분 코딩 챌린지를 통해 이동 중에도 학습 효과를 높이세요.