Skip to main content
HomeBlogData Literacy

Data Demystified: Data Visualizations that Capture Relationships

In part nine of data demystified, we’ll dive deep into the world of data visualization, continuing with visualizations that capture relationships.
Sep 2022  · 5 min read

Welcome to part nine of our month-long data demystified series. As part of Data Literacy Month, this series will clarify key concepts from the world of data, answer the questions you may be too afraid to ask and have fun along the way. If you want to start at the beginning, read our first entry in the series: What is a Dataset?

Data Demystified: Data Visualizations that Capture relationships banner

This week, we’ll cover common data visualizations and how to interpret them. Data visualization is often called the gateway drug into data science; this blog post will look at data visualizations that capture relationships and how to interpret them.

Visualizations that Visualize Relationships

Capturing relationships between different data points is a crucial application of data visualization. This section will look at the most popular visualizations that capture relationships. Most notably, bar charts, scatter plots, bubble charts, and more.

Bar Charts

A bar chart compares the value of multiple categories. It consists of rectangles whose lengths are proportional to the value of each category. Bar charts are prevalent because they are easy to read. Businesses commonly use bar charts to make comparisons, like comparing the market share of different brands or the revenue of different regions. 

There are multiple types of bar charts, each suited for a different purpose.

vertical bar chart

In a vertical bar chart, also known as a column chart, the horizontal axis represents the categories while the vertical axis contains the value. 

horizontal bar chart

In a horizontal bar chart, the categories are placed on the vertical axis. It is a better alternative than a column chart when the names of the categories are long. 

clustered bar chart

A clustered bar chart is a vertical or horizontal bar chart that displays more than one data series in one category. It is useful for showing change over time or comparing multiple data series within a category. 

A rule of thumb when creating a bar chart is to start the axis or baseline at zero. Else, you run the risk of exaggerating minor differences between categories. Another tip is to order the bar charts in a meaningful order. Here are some examples of effective bar charts.

three decades of top 5 Madonna singles bar chart

A vertical bar chart. Notice how the baseline starts at zero and that the categories are ranked by decade. (Source: BBC)

madonna richest US female artist horizontal bar chart

A horizontal bar chart comparing the wealth of US female artists. (Source: BBC)

wealth of billionaires has increased clustered horizontal bar chart

A clustered horizontal chart that compares multiple categories in 2019 and 2020 (Source: BBC)

Scatter plot

A scatterplot shows the relationship between two variables by plotting multiple data points. It is often used in data exploration to understand the data and quickly surface potential correlations. 

cost per mile by miles driven scatter plot

A scatter plot comparing cost per mile and miles driven per month (Source: Storytelling with Data)

Here are some practical concepts to consider for interpreting a scatter plot.

  1. Understand the axes: What are the different measures and units visualized on each axis?
  2. The relationship between the variables: Is there an increasing or decreasing relationship? Is the relationship best described with a straight line or an exponential curve?
  3. Identify any outliers: Are there any data points that do not agree with the rest of the graph? Can you explain why the outlier behaves as such?

video games scatter plot

The Economist shows that video games are good for mental health with a scatter plot (Source: The Economist)

Connected scatterplot

A connected scatterplot is a scatterplot with connected data points. It is helpful for visualizing the evolution of two variables with time. 

military budget connected scatterplot

A connected scatterplot showing the relationship between the number of troops and military budget over time (Source: Steve Haroz)

The above scatterplot depicts the relationship between military budget and the number of troops over time. It uses arrows to show the evolution over time. Reading a connected scatter plot might be counterintuitive to some. Thus, helpful annotations are critical to a well-crafted connected scatterplot.

Evolution of car safety connected scatterplot

Using a connected scatterplot to depict the evolution of car safety (Source: New York Times)

Bubble chart

A bubble chart is an extension of the scatterplot that shows the relationship between three variables using the x-axis, y-axis, and the size of the bubble. You can also include a fourth dimension variable in a bubble chart by giving each data point colors. 

competitive landscape bubble charts b&w

competitive bubble charts in color

Bubble charts and colored bubble charts in action (Source: Storytelling with Data)

Anyone using bubble charts should note that a bubble chart can overwhelm the reader. Thus, it is helpful to complement bubble charts with interactive narration. A great example of this comes from Hans Rosling’s bubble chart, depicting the relationship between life expectancy, GDP per capita, and population size. 

GapMinder Lifespan bubble chart

The Gapminder bubble chart is one of the most notable uses of bubble charts

Get Started with Data Visualization Today

We hope you enjoyed this short introduction to data visualization. In the next series entry, we’ll look at data visualizations that capture distributions. If you want to get started on your data visualization learning journey today, check out the following resources. 

Topics

Data Visualizations courses

Course

Understanding Data Visualization

2 hr
178.2K
An introduction to data visualization with no coding involved.
See DetailsRight Arrow
Start Course
See MoreRight Arrow
Related
image5.png

blog

Data Demystified: Data Visualizations that Capture Trends

In part eight of data demystified, we’ll dive deep into the world of data visualization, starting off with visualizations that capture trends.
Richie Cotton's photo

Richie Cotton

10 min

image1.png

blog

Data Demystified: Data Visualizations that Capture Distributions

In part 10 of data demystified, we’ll dive deep into the world of data visualization, continuing with visualizations that capture distributions.
Richie Cotton's photo

Richie Cotton

8 min

Correlation vs. Causation

blog

Data Demystified: Correlation vs. Causation

In part six of data demystified, we’ll break down one of the biggest fallacies in data: Correlation does not imply causation
Richie Cotton's photo

Richie Cotton

6 min

The Four Types of Analytics

blog

Data Demystified: The Four Types of Analytics

In the fourth entry of data demystified, we’ll break down the four types of analytics. From a descriptive to prescriptive, we’ll look at how analytics can answer the most pressing questions.
Richie Cotton's photo

Richie Cotton

5 min

An Overview of Descriptive Statistics

blog

Data Demystified: An Overview of Descriptive Statistics

In the fifth entry of data demystified, we provide an overview of the basics of descriptive statistics, one of the fundamental areas of data science.
Richie Cotton's photo

Richie Cotton

6 min

cheat sheet

Data Visualization Cheat Sheet

In this data visualization cheat sheet, you'll learn about the most common data visualizations to employ, when to use them, and their most common use-cases.
Richie Cotton's photo

Richie Cotton

5 min

See MoreSee More