Skip to main content

David Fernández Guillén has completed

Introduction to Importing Data in Python

Start course For Free
3 hr
3,900 XP
Statement of Accomplishment Badge

Loved by learners at thousands of companies


Course Description

As a data scientist, you will need to clean data, wrangle and munge it, visualize it, build predictive models, and interpret these models. Before you can do so, however, you will need to know how to get data into Python. In this course, you'll learn the many ways to import data into Python: from flat files such as .txt and .csv; from files native to other software such as Excel spreadsheets, Stata, SAS, and MATLAB files; and from relational databases such as SQLite and PostgreSQL.
For Business

Training 2 or more people?

Get your team access to the full DataCamp platform, including all the features.
DataCamp for BusinessFor a bespoke solution book a demo.
  1. 1

    Introduction and flat files

    Free

    In this chapter, you'll learn how to import data into Python from all types of flat files, which are a simple and prevalent form of data storage. You've previously learned how to use NumPy and pandas—you will learn how to use these packages to import flat files and customize your imports.

    Play Chapter Now
    Welcome to the course!
    50 xp
    Importing entire text files
    100 xp
    Importing text files line by line
    100 xp
    The importance of flat files in data science
    50 xp
    Pop quiz: what exactly are flat files?
    50 xp
    Why we like flat files and the Zen of Python
    50 xp
    Importing flat files using NumPy
    50 xp
    Using NumPy to import flat files
    100 xp
    Customizing your NumPy import
    100 xp
    Importing different datatypes
    100 xp
    Importing flat files using pandas
    50 xp
    Using pandas to import flat files as DataFrames (1)
    100 xp
    Using pandas to import flat files as DataFrames (2)
    100 xp
    Customizing your pandas import
    100 xp
    Final thoughts on data import
    50 xp
  2. 2

    Importing data from other file types

    You've learned how to import flat files, but there are many other file types you will potentially have to work with as a data scientist. In this chapter, you'll learn how to import data into Python from a wide array of important file types. These include pickled files, Excel spreadsheets, SAS and Stata files, HDF5 files, a file type for storing large quantities of numerical data, and MATLAB files.

    Play Chapter Now
For Business

Training 2 or more people?

Get your team access to the full DataCamp platform, including all the features.

datasets

Chinook (SQLite)LIGO (HDF5)Battledeath (XLSX)Extent of infectious diseases (DTA)Gene expressions (MATLAB)MNISTSales (SAS7BDAT)SeaslugsTitanic

collaborators

Collaborator's avatar
Francisco Castro

prerequisites

Intermediate Python
Hugo Bowne-Anderson HeadshotHugo Bowne-Anderson

Data Scientist

Hugo is a data scientist, educator, writer and podcaster formerly at DataCamp. His main interests are promoting data & AI literacy, helping to spread data skills through organizations and society and doing amateur stand up comedy in NYC. If you want to know what he likes to talk about, definitely check out DataFramed, the DataCamp podcast, which he hosted and produced.
See More

Join over 18 million learners and start Introduction to Importing Data in Python today!

Create Your Free Account

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.