Premium project

Kidney Stones and Simpson's Paradox

Use logistic regression to determine which treatment procedure is more effective for kidney stone removal.

Start Project
8 Tasks1,500 XP

Loved by learners at thousands of companies


Project Description

In this project, you will work with medical data published in 1986 in "The British Medical Journal" where the effectiveness of two types of kidney stone removal treatments (A - open surgery and B - percutaneous nephrolithotomy) were compared. You will use multiple logistic regression and visualize model output to help the doctors determine if there is a difference between the two treatments. While not required, it will also help to have some knowledge of inferential statistics. The dataset used in this project is simulated based on the original medical paper published [here](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1339981/).

Project Tasks

  1. 1
    A new look at an old research study
  2. 2
    Recreate the Treatment X Success summary table
  3. 3
    Bringing stone size into the picture
  4. 4
    When in doubt, rely on a plot
  5. 5
    Identify and confirm the lurking variable
  6. 6
    Remove the confounding effect
  7. 7
    Visualize model output
  8. 8
    Generate insights

Technologies

R R

Topics

Data VisualizationProbability & StatisticsCase Studies
Amy Yang Headshot

Amy Yang

Senior Data Scientist at Uptake

Amy Yang is a Sr. Data Scientist at Uptake where she conducts industrial analytics and build prediction models to major industries and help them increase productivity, security, safety, and reliability. She began using R for simulation and statistical analysis during her study at the University of Pennsylvania where she received her MS degree in Biostatistics. She also teaches R programming and statistical courses for graduate students.
See More

What do other learners have to say?

I've used other sites—Coursera, Udacity, things like that—but DataCamp's been the one that I've stuck with.

Devon Edwards Joseph
Lloyds Banking Group

DataCamp is the top resource I recommend for learning data science.

Louis Maiden
Harvard Business School

DataCamp is by far my favorite website to learn from.

Ronald Bowers
Decision Science Analytics, USAA