Saltar al contenido principal
InicioPythonFeature Engineering for Machine Learning in Python

Feature Engineering for Machine Learning in Python

Create new features to improve the performance of your Machine Learning models.

Comience El Curso Gratis
4 Horas16 Videos53 Ejercicios
28.492 AprendicesTrophyDeclaración de cumplimiento

Crea Tu Cuenta Gratuita

GoogleLinkedInFacebook

o

Al continuar, acepta nuestros Términos de uso, nuestra Política de privacidad y que sus datos se almacenan en los EE. UU.
Group¿Entrenar a 2 o más personas?Pruebe DataCamp para empresas

Preferido por estudiantes en miles de empresas


Descripción del curso

Every day you read about the amazing breakthroughs in how the newest applications of machine learning are changing the world. Often this reporting glosses over the fact that a huge amount of data munging and feature engineering must be done before any of these fancy models can be used. In this course, you will learn how to do just that. You will work with Stack Overflow Developers survey, and historic US presidential inauguration addresses, to understand how best to preprocess and engineer features from categorical, continuous, and unstructured data. This course will give you hands-on experience on how to prepare any data for your own machine learning models.
Empresas

Group¿Entrenar a 2 o más personas?

Obtenga acceso de su equipo a la biblioteca completa de DataCamp, con informes centralizados, tareas, proyectos y más
Pruebe DataCamp Para EmpresasPara obtener una solución a medida, solicite una demonstración.

En las siguientes pistas

Científico de Machine Learning con Python

Ir a la pista
  1. 1

    Creating Features

    Gratuito

    In this chapter, you will explore what feature engineering is and how to get started with applying it to real-world data. You will load, explore and visualize a survey response dataset, and in doing so you will learn about its underlying data types and why they have an influence on how you should engineer your features. Using the pandas package you will create new features from both categorical and continuous columns.

    Reproducir Capítulo Ahora
    Why generate features?
    50 xp
    Getting to know your data
    100 xp
    Selecting specific data types
    100 xp
    Dealing with categorical features
    50 xp
    One-hot encoding and dummy variables
    100 xp
    Dealing with uncommon categories
    100 xp
    Numeric variables
    50 xp
    Binarizing columns
    100 xp
    Binning values
    100 xp
  2. 2

    Dealing with Messy Data

    This chapter introduces you to the reality of messy and incomplete data. You will learn how to find where your data has missing values and explore multiple approaches on how to deal with them. You will also use string manipulation techniques to deal with unwanted characters in your dataset.

    Reproducir Capítulo Ahora
  3. 4

    Dealing with Text Data

    Finally, in this chapter, you will work with unstructured text data, understanding ways in which you can engineer columnar features out of a text corpus. You will compare how different approaches may impact how much context is being extracted from a text, and how to balance the need for context, without too many features being created.

    Reproducir Capítulo Ahora
Empresas

Group¿Entrenar a 2 o más personas?

Obtenga acceso de su equipo a la biblioteca completa de DataCamp, con informes centralizados, tareas, proyectos y más

En las siguientes pistas

Científico de Machine Learning con Python

Ir a la pista

Sets De Datos

Stack Overflow Survey Responses (Modified)US Presidential Inauguration Addresses

Colaboradores

Collaborator's avatar
Sumedh Panchadhar
Collaborator's avatar
Hillary Green-Lerman
Robert O'Callaghan HeadshotRobert O'Callaghan

Director of Data Science, Ordergroove

Ver Mas

¿Qué tienen que decir otros alumnos?

Únete a 13 millones de estudiantes y empeza Feature Engineering for Machine Learning in Python hoy!

Crea Tu Cuenta Gratuita

GoogleLinkedInFacebook

o

Al continuar, acepta nuestros Términos de uso, nuestra Política de privacidad y que sus datos se almacenan en los EE. UU.