Skip to main content
Grzegorz Gąsiewski avatar

Grzegorz Gąsiewski has completed

Streamlined Data Ingestion with pandas

Start course For Free
4 hours
4,500 XP
Statement of Accomplishment Badge

Loved by learners at thousands of companies


Course Description

Before you can analyze data, you first have to acquire it. This course teaches you how to build pipelines to import data kept in common storage formats. You’ll use pandas, a major Python library for analytics, to get data from a variety of sources, from spreadsheets of survey responses, to a database of public service requests, to an API for a popular review site. Along the way, you’ll learn how to fine-tune imports to get only what you need and to address issues like incorrect data types. Finally, you’ll assemble a custom dataset from a mix of sources.
For Business

GroupTraining 2 or more people?

Get your team access to the full DataCamp library, with centralized reporting, assignments, projects and more
Try DataCamp for BusinessFor a bespoke solution book a demo.
  1. 1

    Importing Data from Flat Files

    Free

    Practice using pandas to get just the data you want from flat files, learn how to wrangle data types and handle errors, and look into some U.S. tax data along the way.

    Play Chapter Now
    Introduction to flat files
    50 xp
    Get data from CSVs
    100 xp
    Get data from other flat files
    100 xp
    Modifying flat file imports
    50 xp
    Import a subset of columns
    100 xp
    Import a file in chunks
    100 xp
    Handling errors and missing data
    50 xp
    Specify data types
    100 xp
    Set custom NA values
    100 xp
    Skip bad data
    100 xp
  2. 4

    Importing JSON Data and Working with APIs

    Learn how to work with JSON data and web APIs by exploring a public dataset and getting cafe recommendations from Yelp. End by learning some techniques to combine datasets once they have been loaded into data frames.

    Play Chapter Now
For Business

GroupTraining 2 or more people?

Get your team access to the full DataCamp library, with centralized reporting, assignments, projects and more

datasets

Vermont tax return data by ZIP codeFreeCodeCamp New Developer Survey response subsetNYC weather and 311 housing complaints

collaborators

Collaborator's avatar
Adrián Soto
Collaborator's avatar
Hillary Green-Lerman
Amany Mahfouz HeadshotAmany Mahfouz

Data scientist via spatial analytics and geography.

See More

Join over 14 million learners and start Streamlined Data Ingestion with pandas today!

Create Your Free Account

GoogleLinkedInFacebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.