Premium project

Classify Suspected Infection in Patients

Classify patients with suspected infections using data.table and electronic health records.

Start Project
12 Tasks1,500 XP

Loved by learners at thousands of companies

Project Description

Sepsis is a deadly illness that accounts for a large portion of in-hospital deaths. It occurs when a person's organs shut down in response to a severe infection. This public health problem is a major target for research, and hospital records can help us investigate the problem. In this R project, you will identify hospital patients with severe infection using medical record data. To successfully complete this project you should have some experience with the package `data.table`, including using the `:=` operator, grouping aggregations with `by`, and understanding how to use the `shift` function.

Project Tasks

  1. 1
    This patient may have sepsis
  2. 2
    Which antibiotics are "new"?
  3. 3
    Looking at the blood culture data
  4. 4
    Combine the antibiotic data and the blood culture data
  5. 5
    Determine whether each row is in-window
  6. 6
    Check the I.V. requirement
  7. 7
    Find the first day of possible sequences
  8. 8
    Simplify the data
  9. 9
    Extract first four rows for each blood culture
  10. 10
    Consecutive sequence
  11. 11
    Select the patients who meet criteria
  12. 12
    Find the prevalence of sepsis




Data ManipulationImporting & Cleaning Data
JoAnn Alvarez Headshot

JoAnn Alvarez

Data Scientist at Tenet Healthcare

JoAnn is a data scientist at Tenet Healthcare and previously worked in biostatistics at Vanderbilt Medical School. She studied mathematical statistics at University of Maryland and is passionate about machine learning.
See More

What do other learners have to say?

I've used other sites—Coursera, Udacity, things like that—but DataCamp's been the one that I've stuck with.

Devon Edwards Joseph
Lloyds Banking Group

DataCamp is the top resource I recommend for learning data science.

Louis Maiden
Harvard Business School

DataCamp is by far my favorite website to learn from.

Ronald Bowers
Decision Science Analytics, USAA