본문으로 바로가기

강의

Python으로 연습하는 Machine Learning 면접 질문

고급기술 수준

업데이트됨 2022. 9.

Python 머신러닝 면접 질문을 연습해 지식을 다지고 다음 면접을 준비하세요.

무료로 강의 시작

PythonMachine Learning

4시간

16 동영상

60 연습 문제

4,600 XP

12,158

성취 증명서

수천 개 기업의 학습자들이 사랑하는

팀을 교육하시나요?

비즈니스용으로 체험해 보세요

강의 설명

Machine Learning 면접은 어떻게 준비해야 할까요? 이 과정에서는 데이터 사이언티스트 직무를 위한 Python 기반의 대표적인 Machine Learning(ML) 면접 질문 15개에 대한 답변을 준비해 봅니다.

질문은 데이터 전처리, 데이터 시각화, 지도 학습, 비지도 학습, 앙상블, 모델 선택, 모델 평가의 일곱 가지 핵심 주제를 다룹니다.

먼저 데이터 전처리와 데이터 시각화 관련 질문부터 시작해요. 필요한 전처리 단계를 모두 수행한 뒤, 실전 역량을 다지기 위해 예측용 ML 모델을 만들어 봅니다.

다음으로 지도 학습 기법을 살펴본 뒤 비지도 학습으로 넘어갑니다. 역할에 따라, Machine Learning 면접에서는 두 주제 모두를 다루는 경우가 많습니다.

마지막으로 모델 선택과 평가를 정리하면서, 모델 일반화 성능을 평가하는 방법을 알아보고, 앙상블 모델을 구축하며 다양한 기법을 살펴봅니다.

이 과정을 마치면, 필요한 이론적 배경과 함께 Python 코드로 15개 질문에 성공적으로 답할 수 있는 능력을 갖추게 됩니다.

코딩 예시는 사용이 쉽고 Python에서 가장 중요한 Machine Learning 기법을 폭넓게 지원하는 scikit-learn 패키지를 주로 활용합니다.

이 과정은 기본적인 Machine Learning 이론을 따로 설명하지 않습니다. 해당 내용은 선수 과목에서 다룹니다.

선수 조건

Unsupervised Learning in Python Supervised Learning with scikit-learn

1

Data Pre-processing and Visualization

In the first chapter of this course, you'll perform all the preprocessing steps required to create a predictive machine learning model, including what to do with missing values, outliers, and how to normalize your dataset.

Handling missing data

The hunt for missing values

Simple imputation

Iterative imputation

Data distributions and transformations

Training vs test set distributions and transformations

Train/test distributions

Log and power transformations

Data outliers and scaling

Outlier detection

Handling outliers

Z-score standardization

2

Supervised Learning

In the second chapter of this course, you'll practice different several aspects of supervised machine learning techniques, such as selecting the optimal feature subset, regularization to avoid model overfitting, feature engineering, and ensemble models to address the so-called bias-variance trade-off.

Regression: feature selection

Best feature subset

Filter and wrapper methods

Feature selection through feature importance

Regression: regularization

Avoiding overfitting

Lasso regularization

Ridge regularization

Classification: feature engineering

Classification model features

Logistic regression baseline classifier

Ensemble methods

Bootstrap aggregation (bagging)

3

Unsupervised Learning

In the third chapter of this course, you'll use unsupervised learning to apply feature extraction and visualization techniques for dimensionality reduction and clustering methods to select not only an appropriate clustering algorithm but optimal cluster number for a dataset.

Dimensionality reduction: feature extraction

The curse of dimensionality

Principal component analysis

Singular value decomposition

Dimensionality reduction: visualization techniques

Reducing high-dimensional data

Visualization separation of classes with PCA I

Visualization PCs with a scree plot

Clustering analysis: selecting the right clustering algorithm

Clustering algorithms

K-means clustering

Hierarchical agglomerative clustering

Clustering analysis: choosing the optimal number of clusters

What is the optimal k?

Silhouette method

Elbow method

4

Model Selection and Evaluation

In the fourth and final chapter of this course, you'll really step it up and apply bootstrapping and cross-validation to evaluate performance for model generalization, resampling techniques to imbalanced classes, detect and remove multicollinearity, and build an ensemble model.

Model generalization: bootstrapping and cross-validation

Validating model performance

Decision tree

A forest of decision trees

Model evaluation: imbalanced classification models

X-ray weapon detection

Imbalanced class metrics

Resampling techniques

Model selection: regression models

Addressing multicollinearity

Multicollinearity techniques - feature engineering

Multicollinearity techniques - PCA

Model selection: ensemble models

Random forest vs gradient boosting

Random forest ensemble

Gradient boosting ensemble

Python으로 연습하는 Machine Learning 면접 질문

강의
완료

수료증 획득

LinkedIn 프로필, 이력서 또는 CV에 이 인증서를 추가하세요
소셜 미디어와 성과 평가에서 공유하세요지금 등록

19백만 명 이상의 학습자와 함께 Python으로 연습하는 Machine Learning 면접 질문을(를) 시작하세요!

DataCamp for Mobile을 통해 데이터 분석 능력을 향상시키세요.

모바일 강좌와 매일 5분 코딩 챌린지를 통해 이동 중에도 학습 효과를 높이세요.