Data Science Tutorials
Develop your data science skills with tutorials in our blog. We cover everything from intricate data visualizations in Tableau to version control features in Git.
Discover categories
Training 2 or more people?Try DataCamp for Business
Web Scraping & NLP in Python
Learn to scrape novels from the web and plot word frequency distributions; You will gain experience with Python packages requests, BeautifulSoup and nltk.
Hugo Bowne-Anderson
October 13, 2017
Hierarchical indices, groupby and pandas
In this tutorial, you’ll learn about multi-indices for pandas DataFrames and how they arise naturally from groupby operations on real-world data sets.
Hugo Bowne-Anderson
March 31, 2023
Groupby, split-apply-combine and pandas
In this tutorial, you'll learn how to use the pandas groupby operation, which draws from the well-known split-apply-combine strategy, on Netflix movie data.
Hugo Bowne-Anderson
September 26, 2017
Poker Probability and Statistics with Python
Tackle probability and statistics in Python: learn more about combinations and permutations, dependent and independent events, and expected value.
Daniel Poston
September 21, 2017
Graph Optimization with NetworkX in Python
This NetworkX tutorial will show you how to do graph optimization in Python by solving the Chinese Postman Problem in Python.
Andrew Brooks
September 12, 2017
Detecting Fake News with Scikit-Learn
This scikit-learn tutorial will walk you through building a fake news classifier with the help of Bayesian models.
Katharine Jarmul
August 24, 2017
Python Seaborn Tutorial For Beginners: Start Visualizing Data
This Seaborn tutorial introduces you to the basics of statistical data visualization
Moez Ali
March 15, 2023
Apache Spark Tutorial: ML with PySpark
Apache Spark tutorial introduces you to big data processing, analysis and ML with PySpark.
Karlijn Willems
July 28, 2017
Scikit-Learn Tutorial: Baseball Analytics Pt 2
A Scikit-Learn tutorial to using logistic regression and random forest models to predict which baseball players will be voted into the Hall of Fame
Daniel Poston
June 20, 2017
5 Tips to Write Idiomatic Pandas Code
This tutorial covers 5 ways in which you can easily write pandorable or more idiomatic Pandas code.
DataCamp Team
May 29, 2017
Asyncio: An Introduction
A short introduction to asynchronous I/O with the asyncio package.
DataCamp Team
May 8, 2017
Scikit-Learn Tutorial: Baseball Analytics Pt 1
A scikit-learn tutorial to predicting MLB wins per season by modeling data to KMeans clustering model and linear regression models.
Daniel Poston
May 4, 2017