Master the basics of data analysis by manipulating common data structures such as vectors, matrices and data frames.

- 337,126 Students
- 62 Exercises
- 0 Videos
- 4 hours
- 6200 XP

- Intro to basics
- Vectors
- Matrices
- Factors
- Data frames

Master the basics of data analysis in Python. Expand your data science skill set by learning scientific computing with numpy.

- 127,947 Students
- 57 Exercises
- 11 Videos
- 4 hours
- 4700 XP

- Python Basics
- Python Lists
- Functions and Packages
- Numpy

Continue your journey to become an R ninja by learning about conditional statements, loops, and vector functions.

- 105,397 Students
- 81 Exercises
- 14 Videos
- 6 hours
- 7150 XP

- Conditionals and Control Flow
- Loops
- Functions
- The apply family
- Utilities

Level up your Python data science skills creating visualizations using matplotlib and manipulating data frames with Pandas.

- 45,786 Students
- 87 Exercises
- 18 Videos
- 4 hours
- 7400 XP

- Matplotlib
- Dictionaries & Pandas
- Logic, Control Flow and Filtering
- Loops
- Case Study: Hacker Statistics

Learn to produce meaningful and beautiful data visualizations with ggplot2 by understanding the grammar of graphics.

- 29,966 Students
- 62 Exercises
- 14 Videos
- 5 hours
- 5250 XP

- Introduction
- Data
- Aesthetics
- Geometries
- qplot and wrap-up

Learn to train and assess models performing common machine learning tasks such as classification, regression and clustering.

- 37,028 Students
- 81 Exercises
- 15 Videos
- 6 hours
- 6700 XP

- What is Machine Learning
- Performance measures
- Classification
- Regression
- Clustering

Master fundamental techniques for data manipulation using the select, mutate, filter, arrange, and summarise functions in dplyr.

- 41,171 Students
- 42 Exercises
- 11 Videos
- 4 hours
- 3400 XP

- Introduction to dplyr and tbls
- Select and mutate
- Filter and arrange
- Summarise and the pipe operator
- Group_by and working with databases

Learn about datasets in R, study types, sampling strategies, and experimental design.

- 3,873 Students
- 45 Exercises
- 15 Videos
- 4 hours
- 3150 XP

- Language of data
- Study types and cautionary tales
- Sampling strategies and experimental design
- Case study

Learn to explore your data so you can properly clean and prepare it data for analysis.

- 25,070 Students
- 59 Exercises
- 16 Videos
- 4 hours
- 4750 XP

- Introduction and exploring raw data
- Tidying data
- Preparing data for analysis
- Putting it all together

Say you have a collection of customers with a variety of characteristics such as age, location, and financial history, and you ...

- 797 Students
- 52 Exercises
- 13 Videos
- 4 hours
- 4150 XP

- Clustering for dataset exploration
- Visualization with hierarchical clustering and t-SNE
- Decorrelating your data and dimension reduction
- Discovering interpretable features

Learn to import data into Python from various sources, such as Excel, SQL, SAS and right from the web.

- 9,023 Students
- 54 Exercises
- 15 Videos
- 3 hours
- 4150 XP

- Introduction and flat files
- Importing data from other file types
- Working with relational databases in Python

In this second course in the Python Data Science Toolbox, you'll continue to build your Python Data Science skills. First you'l...

- 779 Students
- 46 Exercises
- 12 Videos
- 4 hours
- 3700 XP

- Using iterators in PythonLand
- List comprehensions and generators
- Bringing it all together!

Learn how to parse data in any format. Whether it's flat files, statistics software, databases, or web data, you'll handle it all.

- 7,987 Students
- 42 Exercises
- 11 Videos
- 3 hours
- 3600 XP

- Importing data from flat files with utils
- readr & data.table
- Importing Excel data
- Reproducible Excel work with XLConnect

Learn how to describe relationships between two numerical quantities and characterize these relationships graphically.

- 1,930 Students
- 58 Exercises
- 18 Videos
- 4 hours
- 4200 XP

- Correlation and regression
- Correlation
- Simple linear regression
- Interpreting regression models
- Model Fit

In this course, you'll learn how to use the industry-standard pandas library to import, build, and manipulate DataFrames.

- 3,455 Students
- 62 Exercises
- 15 Videos
- 4 hours
- 5100 XP

- Data ingestion & inspection
- Exploratory data analysis
- Time series in pandas
- Case Study - Sunlight in Austin

Learn the basics of the important features of the RStudio IDE.

- 3,486 Students
- 69 Exercises
- 24 Videos
- 3 hours
- 3450 XP

- Orientation
- Programming
- Projects

Learn the core techniques necessary to extract meaningful insights from time series data.

- 3,143 Students
- 58 Exercises
- 16 Videos
- 4 hours
- 4600 XP

- Exploratory time series data analysis
- Predicting the future
- Correlation analysis and the autocorrelation function
- Autoregression
- A simple moving average

Learn the art of writing your own functions in Python, as well as key concepts like scoping and error handling.

- 6,724 Students
- 46 Exercises
- 12 Videos
- 3 hours
- 3650 XP

- Writing your own functions
- Default arguments, variable-length arguments and scope
- Lambda functions and error-handling

Learn the fundamentals of writing functions in R so you can make your code more readable and automate repetitive tasks.

- 19,372 Students
- 86 Exercises
- 19 Videos
- 4 hours
- 7250 XP

- A quick refresher
- When and how you should write a function
- Functional programming
- Advanced inputs and outputs
- Robust functions

Master core concepts in data manipulation such as subsetting, updating, indexing and joining your data using data.table.

- 37,655 Students
- 37 Exercises
- 10 Videos
- 4 hours
- 2900 XP

- Data.table novice
- Data.table yeoman
- Data.table expert

Learn to create interactive analyses and automated reports with R Markdown and communicate your insights quickly and effectively.

- 32,069 Students
- 31 Exercises
- 8 Videos
- 3 hours
- 2550 XP

- Authoring R Markdown Reports
- Embedding Code
- Compiling Reports
- Configuring R Markdown (optional)

This course teaches the big ideas in machine learning like how to build and evaluate predictive models.

- 5,340 Students
- 88 Exercises
- 24 Videos
- 4 hours
- 6250 XP

- Regression models: fitting them and evaluating their performance
- Classification models: fitting them and evaluating their performance
- Tuning model parameters to improve performance
- Preprocessing your data
- Selecting models: a case study in churn prediction

This course was designed to get you up to speed with the most important and powerful methodologies in statistics.

- 3,774 Students
- 43 Exercises
- 10 Videos
- 4 hours
- 3800 XP

- What is statistical modeling?
- Designing, training, and evaluating models
- Assessing prediction performance
- Exploring data with models
- Covariates and effect size

Take your data visualization skills to the next level with coordinates, facets, themes, and general best practices in ggplot2.

- 19,355 Students
- 55 Exercises
- 11 Videos
- 5 hours
- 4750 XP

- Statistics
- Coordinates and Facets
- Themes
- Best Practices
- Case Study

In this course, you'll learn the basics of relational databases and why they're everywhere, along with how to interact with them.

- 3,273 Students
- 66 Exercises
- 20 Videos
- 4 hours
- 5450 XP

- Basics of Relational Databases
- Applying Filtering, Ordering and Grouping to Queries
- Advanced SQLAlchemy Queries
- Creating and Manipulating your own Databases
- Putting it all together

Strengthen your knowledge of the topics you learned in Intermediate R with a ton of new and fun exercises.

- 12,637 Students
- 52 Exercises
- 0 Videos
- 4 hours
- 4700 XP

- Conditionals and Control Flow
- Loops
- Functions
- The apply family
- Utilities

After all of the hard work of acquiring data and getting them into a form you can work with, you ultimately want to make clear,...

- 3,752 Students
- 61 Exercises
- 18 Videos
- 3 hours
- 4550 XP

- Graphical exploratory data analysis
- Quantitative exploratory data analysis
- Thinking probabilistically-- Discrete variables
- Thinking probabilistically-- Continuous variables

In this course, you will become an expert in fitting ARIMA models to time series data using R. First, you will explore the nat...

- 1,396 Students
- 45 Exercises
- 13 Videos
- 4 hours
- 3600 XP

- Time Series Data and Models
- Fitting ARMA models
- ARIMA Models
- Seasonal ARIMA

R supports four different graphics systems: base graphics, grid graphics, lattice graphics, and ggplot2. Base graphics is the d...

- 2,002 Students
- 60 Exercises
- 15 Videos
- 4 hours
- 5250 XP

- A quick introduction to base R graphics
- Different plot types
- Adding details to plots
- How much is too much?
- Advanced plot customization and beyond

Learn how to use graphical and numerical techniques to uncover the structure of your data and find variables that suggest inter...

- 1,336 Students
- 54 Exercises
- 15 Videos
- 4 hours
- 3950 XP

- Exploring Categorical Data
- Exploring Numerical Data
- Numerical Summaries
- Case Study

Learn the bag of words technique for text mining with R.

- 12,662 Students
- 69 Exercises
- 15 Videos
- 4 hours
- 5700 XP

- Jumping into text mining with bag of words
- Word clouds and more interesting visuals
- Adding to your tm skills
- Battle of the tech giants for talent

This course extends Intermediate Python for Data Science to provide a stronger foundation in data visualization in Python. The ...

- 1,755 Students
- 58 Exercises
- 14 Videos
- 4 hours
- 5000 XP

- Customizing plots
- Plotting 2D arrays
- Statistical plots with Seaborn
- Analyzing time series and images

In this course, you'll learn how to leverage pandas' extremely powerful data manipulation engine to get the most out of your da...

- 1,463 Students
- 75 Exercises
- 19 Videos
- 4 hours
- 6300 XP

- Extracting and transforming data
- Advanced indexing
- Rearranging and reshaping data
- Grouping data
- Bringing it all together

This course will show you how to combine data sets with dplyr's two table verbs.

- 2,989 Students
- 84 Exercises
- 20 Videos
- 4 hours
- 6550 XP

- Mutating joins
- Filtering joins and set operations
- Assembling data
- Advanced joining
- Case study

This course covers the basics of financial trading and how to use quantstrat to build signal-based trading strategies.

- 4,865 Students
- 65 Exercises
- 20 Videos
- 5 hours
- 5150 XP

- Trading basics
- A boilerplate for quantstrat strategies
- Indicators
- Signals
- Rules

Learn to apply statistical modeling in a real-life setting using logistic regression and decision trees to model credit risk.

- 16,490 Students
- 52 Exercises
- 16 Videos
- 4 hours
- 4000 XP

- Introduction and data preprocessing
- Logistic regression
- Decision trees
- Evaluating a credit risk model

Further improve your Python importing skills and learn to work with more web and API data.

- 2,930 Students
- 29 Exercises
- 7 Videos
- 2 hours
- 2400 XP

- Importing data from the Internet
- Interacting with APIs to import data from the web
- Diving deep into the Twitter API

The xts and zoo packages make the task of managing and manipulating ordered observations fast and mistake free.

- 4,678 Students
- 55 Exercises
- 15 Videos
- 4 hours
- 4500 XP

- Introduction to eXtensible Time Series, using xts and zoo for time series
- First Order of Business - Basic Manipulations
- Merging and modifying time series
- Apply and aggregate by time
- Extra features of xts

Apply your finance and R skills to backtest, analyze, and optimize financial portfolios.

- 9,652 Students
- 57 Exercises
- 14 Videos
- 5 hours
- 4450 XP

- The building blocks
- Analyzing performance
- Performance drivers
- Optimizing the portfolio

Learn how to parse data in any format. Whether it's flat files, statistics software, databases, or web data, you'll handle it all.

- 2,201 Students
- 48 Exercises
- 10 Videos
- 3 hours
- 3950 XP

- Importing data from databases (Part 1)
- Importing data from databases (Part 2)
- Importing data from the web (Part 1)
- Importing data from the web (Part 2)
- Importing data from statistical software packages

Learn how to create versatile, data-driven graphics using Bokeh.

- 2,859 Students
- 75 Exercises
- 20 Videos
- 4 hours
- 6100 XP

- Basic plotting with Bokeh
- Layouts, Interactions, and Annotations
- High-level Charts
- Building interactive apps with Bokeh
- Putting It All Together! A Case Study

Learn about the practice of drawing conclusions about a larger population from a sample of data: a process known as statistical...

- 604 Students
- 58 Exercises
- 17 Videos
- 4 hours
- 4400 XP

- Introduction to ideas of inference
- Completing a randomization test: gender discrimination
- Hypothesis testing errors: opportunity cost
- Confidence intervals

Further your knowledge of RStudio and learn how to integrate Git, LaTeX, and Shiny

- 1,034 Students
- 77 Exercises
- 29 Videos
- 3 hours
- 3850 XP

- Packages
- Version Control
- Reporting

Learn to create interactive graphs to display distributions, relationships, model fits, and more using ggvis.

- 35,421 Students
- 45 Exercises
- 11 Videos
- 4 hours
- 3700 XP

- The Grammar of Graphics
- Lines and Syntax
- Transformations
- Interactivity and Layers
- Customizing Axes, Legends, and Scales

Where should you buy a house to get the most value for your money? Your first step might be to make a map, but spatial data can...

- 2,156 Students
- 58 Exercises
- 15 Videos
- 4 hours
- 5000 XP

- Basic mapping with ggplot2 and ggmap
- Point and polygon data
- Raster data and color
- Data import and projections

Manage the complexity in your code using object-oriented programming with the S3 and R6 systems.

- 1,755 Students
- 56 Exercises
- 18 Videos
- 4 hours
- 4300 XP

- Introduction to Object-Oriented Programming
- Using S3
- Using R6
- R6 Inheritance
- Advanced R6 Usage

Once you've started learning tools for data manipulation and visualization like dplyr and ggplot2, this course gives you a chan...

- 1,451 Students
- 58 Exercises
- 15 Videos
- 4 hours
- 4800 XP

- Data cleaning and summarizing with dplyr
- Data visualization with ggplot2
- Tidy modeling with broom
- Joining and tidying

After completing Statistical Thinking in Python (Part 1), you have the probabilistic mindset and foundational hacker stats skil...

- 1,121 Students
- 67 Exercises
- 15 Videos
- 4 hours
- 5450 XP

- Parameter estimation by optimization
- Bootstrap confidence intervals
- Introduction to hypothesis testing
- Hypothesis test examples
- Putting it all together: a case study

Learn to translate your knowledge of SAS, SPSS, or Stata into R using the same statistics techniques you're familiar with.

- 28,766 Students
- 196 Exercises
- 52 Videos
- 16 hours
- 14450 XP

- Introduction
- Installing & Maintaining R
- Help & Documentation
- RStudio Basics
- Programming Language Basics

This course covers some advanced topics including strategies for handling large data sets and specialty plots.

- 1,057 Students
- 87 Exercises
- 19 Videos
- 6 hours
- 7650 XP

- Statistical plots
- Plots for specific data types (Part 1)
- Plots for specific data types (Part 2)
- ggplot2 Internals
- Data Munging and Visualization Case Study

In this series of four case studies, you'll revisit key concepts from our courses on importing and cleaning data in R.

- 3,020 Students
- 35 Exercises
- 0 Videos
- 4 hours
- 3500 XP

- Ticket Sales Data
- MBTA Ridership Data
- World Food Facts
- School Attendance Data

In this follow-up course, you will expand your stat modeling skills from part 1 and dive into more advanced concepts.

- 903 Students
- 41 Exercises
- 10 Videos
- 4 hours
- 3600 XP

- Effect size and interaction
- Total and partial change
- Sampling variability and mathematical transforms
- Variables working together

Why value bonds? Bonds are securities issued by governments or corporations that pay interest over a fixed schedule and are the...

- 716 Students
- 43 Exercises
- 13 Videos
- 4 hours
- 3250 XP

- Introduction and Plain Vanilla Bond Valuation
- Yield to Maturity
- Duration and Convexity
- Comprehensive Example

This follow-up course on manipulating time series data in R does not cover new data manipulation concepts. Instead, you will st...

- 901 Students
- 50 Exercises
- 12 Videos
- 4 hours
- 4000 XP

- Flight Data
- Weather Data
- Economic Data
- Sports Data

Use a rich baseball dataset from the MLB's Statcast system to practice your data exploration skills.

- 5,024 Students
- 69 Exercises
- 14 Videos
- 4 hours
- 6200 XP

- Exploring pitch velocities
- Exploring pitch types
- Exploring pitch locations
- Exploring batted ball outcomes

