Skip to main content
learning data science

Data Science Tutorials

Develop your data science skills with tutorials in our blog. We cover everything from intricate data visualizations in Tableau to version control features in Git.
Discover categories
R Programming

Pipes in R Tutorial For Beginners

Learn more about the famous pipe operator %>% and other pipes in R, why and how you should use them and what alternatives you can consider!

Karlijn Willems

November 16, 2017

Python

Web Scraping & NLP in Python

Learn to scrape novels from the web and plot word frequency distributions; You will gain experience with Python packages requests, BeautifulSoup and nltk.
Hugo Bowne-Anderson's photo

Hugo Bowne-Anderson

October 13, 2017

Python

Hierarchical indices, groupby and pandas

In this tutorial, you’ll learn about multi-indices for pandas DataFrames and how they arise naturally from groupby operations on real-world data sets.
Hugo Bowne-Anderson's photo

Hugo Bowne-Anderson

October 2, 2017

Python

Groupby, split-apply-combine and pandas

In this tutorial, you'll learn how to use the pandas groupby operation, which draws from the well-known split-apply-combine strategy, on Netflix movie data.
Hugo Bowne-Anderson's photo

Hugo Bowne-Anderson

September 26, 2017

Python

Poker Probability and Statistics with Python

Tackle probability and statistics in Python: learn more about combinations and permutations, dependent and independent events, and expected value.
Daniel Poston's photo

Daniel Poston

September 21, 2017

Python

Graph Optimization with NetworkX in Python

This NetworkX tutorial will show you how to do graph optimization in Python by solving the Chinese Postman Problem in Python.
Andrew Brooks's photo

Andrew Brooks

September 12, 2017

Python

Detecting Fake News with Scikit-Learn

This scikit-learn tutorial will walk you through building a fake news classifier with the help of Bayesian models.
Katharine Jarmul's photo

Katharine Jarmul

August 24, 2017

Python

Python Seaborn Tutorial For Beginners

This Seaborn tutorial introduces you to the basics of statistical data visualization
Karlijn Willems's photo

Karlijn Willems

August 10, 2017

Python

Apache Spark Tutorial: ML with PySpark

Apache Spark tutorial introduces you to big data processing, analysis and ML with PySpark.
Karlijn Willems's photo

Karlijn Willems

July 28, 2017

Python

Scikit-Learn Tutorial: Baseball Analytics Pt 2

A Scikit-Learn tutorial to using logistic regression and random forest models to predict which baseball players will be voted into the Hall of Fame
Daniel Poston's photo

Daniel Poston

June 20, 2017