Skip to main content

Course

Visualizing Big Data with Trelliscope in R

BasicSkill Level

4.7+

Updated 08/2024

Learn how to visualize big data in R using ggplot2 and trelliscopejs.

Start Course for Free

RData Visualization

4 hr

16 videos

46 Exercises

3,450 XP

6,282

Statement of Accomplishment

Loved by learners at thousands of companies

Training a Team?

Try for Business

Course Description

Having honed your visualization skills by learning ggplot2, it's now time to tackle larger datasets. In this course, you will learn several techniques for visualizing big data, with particular focus on the scalable visualization technique of faceting. You will learn how to put this technique into action using the Trelliscope approach as implemented in the trelliscopejs R package. Trelliscope plugs seamlessly into standard R workflows and produces interactive visualizations that allow you to visually explore your data in detail. By the end of this course, you will be able to easily create interactive exploratory displays of large datasets that will help you and your colleagues gain new insights into your data.

Prerequisites

Introduction to the Tidyverse

1

General strategies for visualizing big data

Learn different strategies for plotting big data using ggplot2, including calculating and plotting summary statistics, various techniques to deal with overplotting, and principles of small multiples with faceting, which leads into Trelliscope.

Visualizing summaries

Daily ride counts

Distribution of cab fare amount

Distribution of payment type

Adding more detail to summaries

Relationship between trip duration and total fare

Faceting daily rides

Tip amount distribution faceted by payment type

Visualizing subsets

Comparing fare distribution by payment type

Visualizing all subsets

2

ggplot2 + TrelliscopeJS

In the previous chapter you saw how faceting can be used as a powerful technique for visualizing a lot of data that can be naturally partitioned in some meaningful way. Now, using the trelliscopejs package with ggplot2, you will learn how to create faceted visualizations when the number of partitions in the data becomes too large to effectively view in a single screen.

Faceting with TrelliscopeJS

Trelliscope faceting gapminder by country

Interacting with the TrelliscopeJS displays

Interacting with the display

Additional TrelliscopeJS features

Customizing the gapminder display

Examining the new cognostics

Adding your own cognostics

Adding custom cognostics

Interpreting custom cognostics

3

Trelliscope in the Tidyverse

The ggplot2 + trelliscopejs interface is easy to use, but trelliscopejs also provides a faceted plotting mechanism that gives you much more flexibility in what plotting system you use and how to specify cognostics. You will learn all about that in this chapter!

Trelliscope in the tidyverse

Grouping and nesting

Stock price display

Exploring the display

Adding cognostics

Cognostics from nested data frames

Navigating stock plots with new cognostics

Trelliscope options

Customizing the stock display

Visualizing databases of images

Visualizing Pokemon

The most powerful Pokemon

4

Case Study: Exploring Montreal BIXI Bike Data

The Montreal BIXI bike network provides open data for every bike ride, including the date, time, duration, and start and end stations of the ride. In this chapter, you will analyze data from over 4 million bike rides in 2017, going between 546 stations. There are many interesting exploratory questions to ask from this data and you will create exploratory visualizations ranging from summary statistics to detailed Trelliscope visualizations that will give you interesting insight into the data.

Montreal BIXI bike data

Number of daily rides

Examining time-of-day

Effect of membership and weekday

Summary visualization recap

Daily plots

Looking at all days

Top 100 routes dataset

Augmenting the data: Route summary statistics

Visualizing the data: Counts by hour-of-day

Evaluating the visualization

Visualizing Big Data with Trelliscope in R

Course
Complete

Earn Statement of Accomplishment

Add this credential to your LinkedIn profile, resume, or CV
Share it on social media and in your performance reviewEnroll Now

Don’t just take our word for it

*4.7

from 40 reviews

75%

25%

0%

0%

0%

Sort by

Kameron

2 weeks ago

Deyse

5 weeks ago

Kelvin

2 months ago

Mohammed

3 months ago

Tony

3 months ago

ÖVÜNÇ

3 months ago

Deyse

Mohammed

Tony

FAQs

What is Trelliscope and how does it differ from standard ggplot2 faceting?

Trelliscope extends ggplot2 faceting to handle datasets with too many partitions for a single screen. It creates interactive displays you can sort, filter, and explore in detail.

What real dataset is used in the case study chapter?

Chapter 4 uses Montreal BIXI bike network data from 2017, covering over 4 million rides across 546 stations. You will create exploratory visualizations from summary statistics to detailed Trelliscope displays.

What are the only prerequisites for this course?

The only prerequisite is Introduction to the Tidyverse. If you are comfortable with basic tidyverse workflows and ggplot2, you are ready to start.

Does the course cover strategies for dealing with overplotting in large datasets?

Yes, Chapter 1 covers techniques for handling overplotting, calculating and plotting summary statistics, and the principles of small multiples before introducing Trelliscope.

What are cognostics in the context of Trelliscope?

Cognostics are summary statistics or metrics associated with each panel in a Trelliscope display. Chapter 3 teaches you how to specify custom cognostics for more flexible faceted visualizations.

Join over 19 million learners and start Visualizing Big Data with Trelliscope in R today!

Grow your data skills with DataCamp for Mobile

Make progress on the go with our mobile courses and daily 5-minute coding challenges.