Premium project

Reducing Traffic Mortality in the USA

How can we find a good strategy for reducing traffic-related deaths?

Start Project
12 Tasks1,500 XP

Loved by learners at thousands of companies


Project Description

While the rate of fatal road accidents has been decreasing steadily since the 80s, the past ten years have seen a stagnation in this reduction. Coupled with the increase in number of miles driven in the nation, the total number of traffic related-fatalities has now reached a ten year high and is rapidly increasing. By looking at the demographics of traffic accident victims for each US state, we find that there is a lot of variation between states. Now we want to understand if there are patterns in this variation in order to derive suggestions for a policy action plan. In particular, instead of implementing a costly nation-wide plan we want to focus on groups of states with similar profiles. How can we find such groups in a statistically sound way and communicate the result effectively?

Project Tasks

  1. 1
    The raw data files and their format
  2. 2
    Read in and get an overview of the data
  3. 3
    Create a textual and a graphical summary of the data
  4. 4
    Quantify the association of features and accidents
  5. 5
    Fit a multivariate linear regression
  6. 6
    Perform PCA on standardized data
  7. 7
    Visualize the first two principal components
  8. 8
    Find clusters of similar states in the data
  9. 9
    KMeans to visualize clusters in the PCA scatter plot
  10. 10
    Visualize the feature differences between the clusters
  11. 11
    Compute the number of accidents within each cluster
  12. 12
    Make a decision when there is no clear right choice
Technologies
R R
Topics
Data ManipulationData VisualizationMachine LearningImporting & Cleaning Data
Joel Östblom Headshot

Joel Östblom

PhD Candidate at University of Toronto
Joel is a PhD student in Biomedical Engineering at the University of Toronto, where he uses computational and experimental approaches to better understand fundamental stem cell decisions. Outside school, he enjoys playing ice hockey, eating and making food, being in nature, and figuring out how he can maximize the time he spends inside vim.
See More

What do other learners have to say?

I've used other sites—Coursera, Udacity, things like that—but DataCamp's been the one that I've stuck with.

Devon Edwards Joseph
Lloyds Banking Group

DataCamp is the top resource I recommend for learning data science.

Louis Maiden
Harvard Business School

DataCamp is by far my favorite website to learn from.

Ronald Bowers
Decision Science Analytics, USAA