Ir al contenido principal

Curso

Machine learning con modelos basados en árboles en Python

IntermedioNivel de habilidad

Actualizado 12/2025

En este curso, aprenderás a utilizar modelos basados en árboles y conjuntos para regresión y clasificación utilizando scikit-learn.

Comienza El Curso Gratis

PythonMachine Learning5 h15 vídeos57 Ejercicios4,650 XP110K+Certificado de logros

Crea Tu Cuenta Gratuita

o

Al continuar, aceptas nuestros Términos de uso, nuestra Política de privacidad y que tus datos se almacenen en los EE. UU.

Preferido por estudiantes en miles de empresas

¿Formar a 2 o más personas?

Probar DataCamp for Business

Descripción del curso

Los árboles de decisión son modelos de aprendizaje supervisado que se utilizan para problemas de clasificación y regresión. Los modelos de árbol presentan una gran flexibilidad que tiene un precio: por un lado, los árboles son capaces de captar relaciones no lineales complejas; por otro, son propensos a memorizar el ruido presente en un conjunto de datos. Al agregar las predicciones de árboles entrenados de forma diferente, los métodos de conjunto aprovechan la flexibilidad de los árboles, al tiempo que reducen su tendencia a memorizar el ruido. Los métodos de ensamblaje se utilizan en diversos campos y tienen un historial probado de victorias en muchas competiciones de machine learning. En este curso, aprenderás a utilizar Python para entrenar árboles de decisión y modelos basados en árboles con la sencilla biblioteca de machine learning scikit-learn. Comprenderás las ventajas e inconvenientes de los árboles y demostrarás cómo el ensamblaje puede paliar estos inconvenientes, todo ello mientras practicas con conjuntos de datos del mundo real. Por último, también comprenderás cómo afinar los hiperparámetros más influyentes para sacar el máximo partido a tus modelos.

Requisitos previos

Supervised Learning with scikit-learn

1

Classification and Regression Trees

Classification and Regression Trees (CART) are a set of supervised learning models used for problems involving classification and regression. In this chapter, you'll be introduced to the CART algorithm.

Decision tree for classification

Train your first classification tree

Evaluate the classification tree

Logistic regression vs classification tree

Classification tree Learning

Growing a classification tree

Using entropy as a criterion

Entropy vs Gini index

Decision tree for regression

Train your first regression tree

Evaluate the regression tree

Linear regression vs regression tree

Iniciar Capítulo

2

The Bias-Variance Tradeoff

The bias-variance tradeoff is one of the fundamental concepts in supervised machine learning. In this chapter, you'll understand how to diagnose the problems of overfitting and underfitting. You'll also be introduced to the concept of ensembling where the predictions of several models are aggregated to produce predictions that are more robust.

Generalization Error

Complexity, bias and variance

Overfitting and underfitting

Diagnose bias and variance problems

Instantiate the model

Evaluate the 10-fold CV error

Evaluate the training error

High bias or high variance?

Ensemble Learning

Define the ensemble

Evaluate individual classifiers

Better performance with a Voting Classifier

Iniciar Capítulo

3

Bagging and Random Forests

Bagging is an ensemble method involving training the same algorithm many times using different subsets sampled from the training data. In this chapter, you'll understand how bagging can be used to create a tree ensemble. You'll also learn how the random forests algorithm can lead to further ensemble diversity through randomization at the level of each split in the trees forming the ensemble.

Define the bagging classifier

Evaluate Bagging performance

Out of Bag Evaluation

Prepare the ground

OOB Score vs Test Set Score

Random Forests (RF)

Train an RF regressor

Evaluate the RF regressor

Visualizing features importances

Iniciar Capítulo

4

Boosting

Boosting refers to an ensemble method in which several models are trained sequentially with each model learning from the errors of its predecessors. In this chapter, you'll be introduced to the two boosting methods of AdaBoost and Gradient Boosting.

Define the AdaBoost classifier

Train the AdaBoost classifier

Evaluate the AdaBoost classifier

Gradient Boosting (GB)

Define the GB regressor

Train the GB regressor

Evaluate the GB regressor

Stochastic Gradient Boosting (SGB)

Regression with SGB

Train the SGB regressor

Evaluate the SGB regressor

Iniciar Capítulo

5

Model Tuning

The hyperparameters of a machine learning model are parameters that are not learned from data. They should be set prior to fitting the model to the training set. In this chapter, you'll learn how to tune the hyperparameters of a tree-based model using grid search cross validation.

Tuning a CART's Hyperparameters

Tree hyperparameters

Set the tree's hyperparameter grid

Search for the optimal tree

Evaluate the optimal tree

Tuning a RF's Hyperparameters

Random forests hyperparameters

Set the hyperparameter grid of RF

Search for the optimal forest

Evaluate the optimal forest

Congratulations!

Iniciar Capítulo

Machine learning con modelos basados en árboles en Python

Curso
completo

Obtener certificado de logros

Añade esta certificación a tu perfil de LinkedIn o a tu currículum.
Compártelo en redes sociales y en tu evaluación de desempeño.Inscríbete Ahora

¡Únete a 19 millones de estudiantes y empieza Machine learning con modelos basados en árboles en Python hoy mismo!

Crea Tu Cuenta Gratuita

o

Al continuar, aceptas nuestros Términos de uso, nuestra Política de privacidad y que tus datos se almacenen en los EE. UU.

Desarrolla tus habilidades de datos con la aplicación móvil de DataCamp

Progresa desde cualquier dispositivo móvil con nuestros cursos y desafíos de programación diarios de 5 minutos.