Skip to main content

Visualizing Big Data with Trelliscope in R

Learn how to visualize big data in R using ggplot2 and trelliscopejs.

Start Course for Free
4 Hours16 Videos46 Exercises4,425 Learners3450 XPBig Data TrackInteractive Data Visualization Track

Create Your Free Account



By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA. You confirm you are at least 16 years old (13 if you are an authorized Classrooms user).

Loved by learners at thousands of companies

Course Description

Having honed your visualization skills by learning ggplot2, it's now time to tackle larger datasets. In this course, you will learn several techniques for visualizing big data, with particular focus on the scalable visualization technique of faceting. You will learn how to put this technique into action using the Trelliscope approach as implemented in the trelliscopejs R package. Trelliscope plugs seamlessly into standard R workflows and produces interactive visualizations that allow you to visually explore your data in detail. By the end of this course, you will be able to easily create interactive exploratory displays of large datasets that will help you and your colleagues gain new insights into your data.

  1. 1

    General strategies for visualizing big data


    Learn different strategies for plotting big data using ggplot2, including calculating and plotting summary statistics, various techniques to deal with overplotting, and principles of small multiples with faceting, which leads into Trelliscope.

    Play Chapter Now
    Visualizing summaries
    50 xp
    Daily ride counts
    100 xp
    Distribution of cab fare amount
    100 xp
    Distribution of payment type
    100 xp
    Adding more detail to summaries
    50 xp
    Relationship between trip duration and total fare
    100 xp
    Faceting daily rides
    100 xp
    Tip amount distribution faceted by payment type
    100 xp
    Visualizing subsets
    50 xp
    Comparing fare distribution by payment type
    100 xp
    Visualizing all subsets
    50 xp
  2. 2

    ggplot2 + TrelliscopeJS

    In the previous chapter you saw how faceting can be used as a powerful technique for visualizing a lot of data that can be naturally partitioned in some meaningful way. Now, using the trelliscopejs package with ggplot2, you will learn how to create faceted visualizations when the number of partitions in the data becomes too large to effectively view in a single screen.

    Play Chapter Now
  3. 3

    Trelliscope in the Tidyverse

    The ggplot2 + trelliscopejs interface is easy to use, but trelliscopejs also provides a faceted plotting mechanism that gives you much more flexibility in what plotting system you use and how to specify cognostics. You will learn all about that in this chapter!

    Play Chapter Now
  4. 4

    Case Study: Exploring Montreal BIXI Bike Data

    The Montreal BIXI bike network provides open data for every bike ride, including the date, time, duration, and start and end stations of the ride. In this chapter, you will analyze data from over 4 million bike rides in 2017, going between 546 stations. There are many interesting exploratory questions to ask from this data and you will create exploratory visualizations ranging from summary statistics to detailed Trelliscope visualizations that will give you interesting insight into the data.

    Play Chapter Now

In the following tracks

Big DataInteractive Data Visualization


richieRichie CottonyashasYashas Roybenjaminfeder-87fafd1d-ffc6-4bd6-9915-1dad3e0f844aBenjamin Feder
Ryan Hafen Headshot

Ryan Hafen

Author of TrelliscopeJS

Ryan Hafen is a statistical consultant and a remote adjunct assistant professor in the Statistics Department at Purdue University. Ryan's research focuses on methodology, tools, and applications in exploratory analysis, statistical model building, and machine learning on large, complex datasets. He is the developer of the datadr and Trelliscope components of the Tessera project, as well as the rbokeh R visualization interface to the Bokeh plotting library. Prior to his work as a statistical consultant, Ryan worked at Pacific Northwest National Laboratory doing applied work on large complex data spanning many domains, including power systems engineering, nuclear forensics, high energy physics, biology, and cyber security.
See More

What do other learners have to say?

I've used other sites—Coursera, Udacity, things like that—but DataCamp's been the one that I've stuck with.

Devon Edwards Joseph
Lloyds Banking Group

DataCamp is the top resource I recommend for learning data science.

Louis Maiden
Harvard Business School

DataCamp is by far my favorite website to learn from.

Ronald Bowers
Decision Science Analytics, USAA