Official Blog

python
+1

PySpark Cheat Sheet: Spark in Python

This PySpark cheat sheet with code samples covers the basics like initializing Spark in Python, loading data, sorting, and repartitioning.
r programming
+1

R Correlation Tutorial

Get introduced to the basics of correlation in R: learn more about correlation coefficients, correlation matrices, plotting correlations, etc.
r programming
+1

DataChats: An Interview with Hank Roark

Nick interviews Hank Roark, senior data scientist at Boeing, about his career, his course, and more.
importing & cleaning data
+1

An Introduction to Cleaning Data in R

an introduction to cleaning and editing data in R, including an introduction to understanding the structure of your data as well as visualizing your data
data analysis

Writing Functions in Python

You will learn to do the following: define functions without parameters, define functions with single parameters, and define functions that return a single value.
data visualization
+1

Intermediate Python for Data Science: Matplotlib

Learn to visualize real data with matplotlib's functions.
python
+2

New Course: Supervised Learning with scikit-learn

New course, "Supervised Learning with scikit-learn", taught by Andreas Müller! Become a machine learning master with this course.
r programming
+2

New Course: Unsupervised Learning in R

New machine learning course: "Unsupervised Learning in R", taught by Hank Roark. Start the first chapter for free!