Перейти к основному содержимому

Course

Winning a Kaggle Competition in Python

ПередовойУровень мастерства

Обновлено 05.2026

Learn how to approach and win competitions on Kaggle.

Начать Курс Бесплатно

PythonMachine Learning4 ч16 videos52 Exercises4,200 XP21,452Свидетельство о достижениях

Создайте бесплатный аккаунт

или

Продолжая, вы принимаете наши Условия использования, нашу Политику конфиденциальности и подтверждаете, что ваши данные хранятся в США.

Пользуется популярностью среди обучающихся в тысячах компаний.

Обучение двух или более человек?

Попробуйте DataCamp for Business

Описание курса

Kaggle is the most famous platform for Data Science competitions. Taking part in such competitions allows you to work with real-world datasets, explore various machine learning problems, compete with other participants and, finally, get invaluable hands-on experience. In this course, you will learn how to approach and structure any Data Science competition. You will be able to select the correct local validation scheme and to avoid overfitting. Moreover, you will master advanced feature engineering together with model ensembling approaches. All these techniques will be practiced on Kaggle competitions datasets.

Предварительные требования

Extreme Gradient Boosting with XGBoost

1

Kaggle competitions process

In this first chapter, you will get exposure to the Kaggle competition process. You will train a model and prepare a csv file ready for submission. You will learn the difference between Public and Private test splits, and how to prevent overfitting.

Competitions overview

Explore train data

Explore test data

Prepare your first submission

Determine a problem type

Train a simple model

Prepare a submission

Public vs Private leaderboard

What model is overfitting?

Train XGBoost models

Explore overfitting XGBoost

Начало Главы

2

Dive into the Competition

Now that you know the basics of Kaggle competitions, you will learn how to study the specific problem at hand. You will practice EDA and get to establish correct local validation strategies. You will also learn about data leakage.

Understand the problem

Understand the problem type

Define a competition metric

Initial EDA

EDA statistics

EDA plots I

EDA plots II

Local validation

K-fold cross-validation

Stratified K-fold

Validation usage

Time K-fold

Overall validation score

Начало Главы

3

Feature Engineering

You will now get exposure to different types of features. You will modify existing features and create new ones. Also, you will treat the missing data accordingly.

Feature engineering

Arithmetical features

Date features

Categorical features

Label encoding

One-Hot encoding

Target encoding

Mean target encoding

K-fold cross-validation

Beyond binary classification

Missing data

Find missing data

Impute missing data

Начало Главы

4

Modeling

Time to bring everything together and build some models! In this last chapter, you will build a base model before tuning some hyperparameters and improving your results with ensembles. You will then get some final tips and tricks to help you compete more efficiently.

Baseline model

Replicate validation score

Baseline based on the date

Baseline based on the gradient boosting

Hyperparameter tuning

Grid search

2D grid search

Model ensembling

Model blending

Model stacking I

Model stacking II

Testing Kaggle forum ideas

Select final submissions

Final thoughts

Начало Главы

Winning a Kaggle Competition in Python

Курс
завершен

Получите свидетельство о достижениях

Добавьте эти данные в свой профиль LinkedIn, резюме или CV.
Поделитесь этим в социальных сетях и в своем отчете об оценке эффективности работы.Запишитесь Прямо Сейчас

Присоединяйтесь 19 миллионов учащихся и начните Winning a Kaggle Competition in Python сегодня!

Создайте бесплатный аккаунт

или

Продолжая, вы принимаете наши Условия использования, нашу Политику конфиденциальности и подтверждаете, что ваши данные хранятся в США.

Развивайте свои навыки работы с данными с помощью DataCamp для мобильных устройств.

Успевайте в обучении на ходу с помощью наших мобильных курсов и ежедневных 5-минутных заданий по программированию.