본문으로 바로가기

강의

Python으로 배우는 선형 모델 입문

중급기술 수준

업데이트됨 2024. 8.

Python으로 선형 모델의 개념과 활용을 배우고, 데이터 패턴을 설명·예측·통찰 도출을 위한 모델을 구축하세요.

무료로 강의 시작

PythonProbability & Statistics

4시간

16 동영상

59 연습 문제

5,050 XP

26,710

성취 증명서

수천 개 기업의 학습자들이 사랑하는

팀을 교육하시나요?

비즈니스용으로 체험해 보세요

강의 설명

과학자의 주요 목표 중 하나는 데이터에서 패턴을 찾고, 그 패턴을 설명·예측·통찰로 이어지는 모델을 만드는 것입니다. 이 가운데 가장 기초적인 패턴이 두 변수 간의 선형 관계입니다. 이 강의는 최소제곱, 선형 회귀, 추정, 부트스트랩 리샘플링과 같은 기법을 통해 데이터의 선형 관계를 탐색하고 정량화하며 모델링하는 방법을 소개합니다. 여기서는 Python 데이터 사이언스 생태계의 강력한 도구인 scipy, statsmodels, scikit-learn을 활용해 선형 모델을 구축하고 평가해 볼 거예요. Python으로 선형 모델의 개념과 활용을 익히면서, 실무형 모델링 입문이자 통계와 Machine Learning의 고급 기법을 학습하기 위한 탄탄한 기반을 마련합니다.

선수 조건

Introduction to Regression with statsmodels in Python

1

Exploring Linear Trends

We start the course with an initial exploration of linear relationships, including some motivating examples of how linear models are used, and demonstrations of data visualization methods from matplotlib. We then use descriptive statistics to quantify the shape of our data and use correlation to quantify the strength of linear relationships between two variables.

Introduction to Modeling Data

Reasons for Modeling: Interpolation

Reasons for Modeling: Extrapolation

Reasons for Modeling: Estimating Relationships

Visualizing Linear Relationships

Plotting the Data

Plotting the Model on the Data

Visually Estimating the Slope & Intercept

Quantifying Linear Relationships

Mean, Deviation, & Standard Deviation

Covariance vs Correlation

Correlation Strength

2

Building Linear Models

Here we look at the parts that go into building a linear model. Using the concept of a Taylor Series, we focus on the parameters slope and intercept, how they define the model, and how to interpret the them in several applied contexts. We apply a variety of python modules to find the model that best fits the data, by computing the optimal values of slope and intercept, using least-squares, numpy, statsmodels, and scikit-learn.

What makes a model linear

Terms in a Model

Model Components

Model Parameters

Interpreting Slope and Intercept

Linear Proportionality

Slope and Rates-of-Change

Intercept and Starting Points

Model Optimization

Residual Sum of the Squares

Minimizing the Residuals

Visualizing the RSS Minima

Least-Squares Optimization

Least-Squares with `numpy`

Optimization with Scipy

Least-Squares with `statsmodels`

3

Making Model Predictions

Next we will apply models to real data and make predictions. We will explore some of the most common pit-falls and limitations of predictions, and we evaluate and compare models by quantifying and contrasting several measures of goodness-of-fit, including RMSE and R-squared.

Modeling Real Data

Linear Model in Anthropology

Linear Model in Oceanography

Linear Model in Cosmology

The Limits of Prediction

Interpolation: Inbetween Times

Extrapolation: Going Over the Edge

Goodness-of-Fit

RMSE Step-by-step

Standard Error

Variation Around the Trend

Variation in Two Parts

4

Estimating Model Parameters

In our final chapter, we introduce concepts from inferential statistics, and use them to explore how maximum likelihood estimation and bootstrap resampling can be used to estimate linear model parameters. We then apply these methods to make probabilistic statements about our confidence in the model parameters.

Inferential Statistics Concepts

Sample Statistics versus Population

Variation in Sample Statistics

Visualizing Variation of a Statistic

Model Estimation and Likelihood

Estimation of Population Parameters

Maximizing Likelihood, Part 1

Maximizing Likelihood, Part 2

Model Uncertainty and Sample Distributions

Bootstrap and Standard Error

Estimating Speed and Confidence

Visualize the Bootstrap

Model Errors and Randomness

Test Statistics and Effect Size

Null Hypothesis

Visualizing Test Statistics

Visualizing the P-Value

Course Conclusion

Python으로 배우는 선형 모델 입문

강의
완료

수료증 획득

LinkedIn 프로필, 이력서 또는 CV에 이 인증서를 추가하세요
소셜 미디어와 성과 평가에서 공유하세요지금 등록

19백만 명 이상의 학습자와 함께 Python으로 배우는 선형 모델 입문을(를) 시작하세요!

DataCamp for Mobile을 통해 데이터 분석 능력을 향상시키세요.

모바일 강좌와 매일 5분 코딩 챌린지를 통해 이동 중에도 학습 효과를 높이세요.