This PySpark cheat sheet with code samples covers the basics like initializing Spark in Python, loading data, sorting, and repartitioning.
PySpark Cheat Sheet: Spark in Python
Get introduced to the basics of correlation in R: learn more about correlation coefficients, correlation matrices, plotting correlations, etc.
R Correlation Tutorial
Nick interviews Hank Roark, senior data scientist at Boeing, about his career, his course, and more.
DataChats: An Interview with Hank Roark
an introduction to cleaning and editing data in R, including an introduction to understanding the structure of your data as well as visualizing your data
importing & cleaning data+1
An Introduction to Cleaning Data in R
You will learn to do the following: define functions without parameters, define functions with single parameters, and define functions that return a single value.
Writing Functions in Python
Learn to visualize real data with matplotlib's functions.
Intermediate Python for Data Science: Matplotlib
New course, "Supervised Learning with scikit-learn", taught by Andreas Müller! Become a machine learning master with this course.
New Course: Supervised Learning with scikit-learn
New machine learning course: "Unsupervised Learning in R", taught by Hank Roark. Start the first chapter for free!