PhD Candidate at University of Toronto
project
Reducing Traffic Mortality in the USA
How can we find a good strategy for reducing traffic-related deaths?
Included withPremium or Teams
Create Your Free Account
or
By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.Training 2 or more people?
Try DataCamp for BusinessProject Description
While the rate of fatal road accidents has been decreasing steadily since the 80s, the past ten years have seen a stagnation in this reduction. Coupled with the increase in number of miles driven in the nation, the total number of traffic related-fatalities has now reached a ten year high and is rapidly increasing.
By looking at the demographics of traffic accident victims for each US state, we find that there is a lot of variation between states. Now we want to understand if there are patterns in this variation in order to derive suggestions for a policy action plan. In particular, instead of implementing a costly nation-wide plan we want to focus on groups of states with similar profiles. How can we find such groups in a statistically sound way and communicate the result effectively?
Project Tasks
- 1The raw data files and their format
- 2Read in and get an overview of the data
- 3Create a textual and a graphical summary of the data
- 4Quantify the association of features and accidents
- 5Fit a multivariate linear regression
- 6Perform PCA on standardized data
- 7Visualize the first two principal components
- 8Find clusters of similar states in the data
- 9KMeans to visualize clusters in the PCA scatter plot
- 10Visualize the feature differences between the clusters
- 11Compute the number of accidents within each cluster
- 12Make a decision when there is no clear right choice
Technologies
Python