Importing Data Into R

Course Description

Importing data into R to start your analyses: it should be the easiest step. Unfortunately, this is almost never the case. Data can come in all sorts of formats, ranging from flat files and statistical software files to databases and web data. Knowing which approach to use is key to getting started with the actual analysis. In this course, you will learn all the basics on how to load data into R so you can get up and running in no time!
  1

    Importing data from flat files


    Lots of data comes in the form of flat files: simple tabular text files. Learn how to import all common formats of flat file data with base R functions and the dedicated readr and data.table packages.

    Introduction & Flat Files
    Any changes?
    Column classes
    readr & data.table
    col_types, skip and n_max
    col_types with collectors
    fread: more advanced use
    Dedicated classes
  2

    Importing data from Excel


    Excel is a very widely used data analysis tool. If you prefer to do your analyses in R, though, you'll need an understanding of importing CSV data into R. This chapter will explain you how to use readxl and gdata to do so. The XLConnect package that takes all of this one step further, will also be discussed.

  4

    Importing data from relational databases


    Many companies store their information in relational databases. The R community has also developed R packages to get data from these architectures. You'll learn how to connect to a database, how to retrieve data from it, and how to make things more efficient by performing a part of your computations on the database side.

  5

    Importing data from the web


    More and more of the information that data scientists are using, resides on the web. Importing this data into R requires an understanding of protocols and typical data formats used on the web. In this chapter, you'll get a crash course in HTTP, learn to perform your own HTTP requests from inside R and get to know a popular web data format: JSON.

Filip Schouwenaars

Data Science Instructor at DataCamp

Data Science Instructor at DataCamp

Filip is the passionate developer behind several of DataCamp's most popular Python, SQL, and R courses. Filip led the development of DataLab, a collaborative data science notebook. Under the motto 'Eat your own dog food', he uses DataLab to understand how users learn on and interact with DataCamp. Filip holds degrees in Electrical Engineering and Artificial Intelligence.
