Skip to main content

RNA-Seq with Bioconductor in R

Use RNA-Seq differential expression analysis to identify genes likely to be important for different diseases or conditions.

Start Course for Free
4 Hours16 Videos44 Exercises11,640 Learners3150 XPAnalyzing Genomic Data Track

Create Your Free Account



By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA. You confirm you are at least 16 years old (13 if you are an authorized Classrooms user).

Loved by learners at thousands of companies

Course Description

RNA-Seq is an exciting next-generation sequencing method used for identifying genes and pathways underlying particular diseases or conditions. As high-throughput sequencing becomes more affordable and accessible to a wider community of researchers, the knowledge to analyze this data is becoming an increasingly valuable skill. Join us in learning about the RNA-Seq workflow and discovering how to identify which genes and biological processes may be important for your condition of interest! We will start the course with a brief overview of the RNA-Seq workflow with an emphasis on differential expression (DE) analysis. Starting with the counts for each gene, the course will cover how to prepare data for DE analysis, assess the quality of the count data, and identify outliers and detect major sources of variation in the data. The DESeq2 R package will be used to model the count data using a negative binomial model and test for differentially expressed genes. Visualization of the results with heatmaps and volcano plots will be performed and the significant differentially expressed genes will be identified and saved.

  1. 1

    Introduction to RNA-Seq theory and workflow


    In this chapter we explore what we can do with RNA-Seq data and why it is exciting. We learn about the different steps and considerations involved in an RNA-Seq workflow.

    Play Chapter Now
    Introduction to RNA-Seq
    50 xp
    Core Concepts
    50 xp
    RNA-Seq Packages
    100 xp
    RNA-Seq Workflow
    50 xp
    Read Alignments
    50 xp
    Exploring the raw count matrix
    100 xp
    Differential gene expression overview
    50 xp
    DGE Theory
    50 xp
    DGE Theory: Metadata
    100 xp
  2. 4

    Exploration of differential expression results

    In this final chapter we explore the differential expression results using visualizations, such as heatmaps and volcano plots. We also review the steps in the analysis and summarize the differential expression workflow with DESeq2.

    Play Chapter Now

In the following tracks

Analyzing Genomic Data


dcamposlizDavid CamposrichieRichie CottonshoninouyeShon Inouye
Mary Piper Headshot

Mary Piper

Bioinformatics Consultant and Trainer

Mary Piper serves dual roles as research analyst and bioinformatics trainer in the Department of Biostatistics at the Harvard T.H. Chan School of Public Health. However, her primary role is the development and instruction of bioinformatics workshops focused on the analysis of next-generation sequencing data. She has a PhD in cellular and molecular biology from the University of Michigan and a background in science education. Her passion for bioinformatics research and teaching led to her desire to pursue bioinformatics as a career and to share that knowledge with the community.
See More

What do other learners have to say?

I've used other sites—Coursera, Udacity, things like that—but DataCamp's been the one that I've stuck with.

Devon Edwards Joseph
Lloyds Banking Group

DataCamp is the top resource I recommend for learning data science.

Louis Maiden
Harvard Business School

DataCamp is by far my favorite website to learn from.

Ronald Bowers
Decision Science Analytics, USAA