courses
Python으로 데이터 정제하기
중급숙련도 수준
업데이트됨 2025. 12.PythonData Preparation413 videos44 exercises3,500 XP150K+성과 증명서
수천 개의 회사에서 학습자들에게 사랑받는 제품입니다.
2명 이상을 교육하시나요?
DataCamp for Business 사용해 보세요강좌 설명
필수 조건
Python ToolboxJoining Data with pandas1
Common data problems
In this chapter, you'll learn how to overcome some of the most common dirty data problems. You'll convert data types, apply range constraints to remove future data points, and remove duplicated data points to avoid double-counting.
2
Text and categorical data problems
Categorical and text data can often be some of the messiest parts of a dataset due to their unstructured nature. In this chapter, you’ll learn how to fix whitespace and capitalization inconsistencies in category labels, collapse multiple categories into one, and reformat strings for consistency.
3
Advanced data problems
In this chapter, you'll dive into more advanced data cleaning problems, such as ensuring that weights are all written in kilograms instead of pounds. You'll also gain invaluable skills that will help you verify that values have been added correctly, and that missing values don't negatively impact your analyses.
4
Record linkage
Record linkage is a powerful technique used to merge multiple datasets together, used when values have typos or different spellings. In this chapter, you'll learn how to link records by calculating the similarity between strings—you'll then use your new skills to join two restaurant review datasets into one clean master dataset.