Category
Topics
Machine Learning Tutorial
Get insights & best practices into AI & machine learning, upskill, and build data cultures. Learn how to get the most out of machine learning models with our tutorials.
Other topics:
Training 2 or more people?Try DataCamp for Business
Web Scraping & NLP in Python
Learn to scrape novels from the web and plot word frequency distributions; You will gain experience with Python packages requests, BeautifulSoup and nltk.
Hugo Bowne-Anderson
October 13, 2017
Detecting Fake News with Scikit-Learn
This scikit-learn tutorial will walk you through building a fake news classifier with the help of Bayesian models.
Katharine Jarmul
August 24, 2017
Apache Spark Tutorial: ML with PySpark
Apache Spark tutorial introduces you to big data processing, analysis and ML with PySpark.
Karlijn Willems
July 28, 2017
Scikit-Learn Tutorial: Baseball Analytics Pt 2
A Scikit-Learn tutorial to using logistic regression and random forest models to predict which baseball players will be voted into the Hall of Fame
Daniel Poston
June 20, 2017
Scikit-Learn Tutorial: Baseball Analytics Pt 1
A scikit-learn tutorial to predicting MLB wins per season by modeling data to KMeans clustering model and linear regression models.
Daniel Poston
May 4, 2017
Deep Learning with Jupyter Notebooks in the Cloud
This step-by-step tutorial will show you how to set up and use Jupyter Notebook on Amazon Web Services (AWS) EC2 GPU for deep learning.
Dan Becker
March 23, 2017
Preprocessing in Data Science (Part 3): Scaling Synthesized Data
You can preprocess the heck out of your data but the proof is in the pudding: how well does your model then perform?
Hugo Bowne-Anderson
May 10, 2016
Preprocessing in Data Science (Part 2): Centering, Scaling and Logistic Regression
Discover whether centering and scaling help your model in a logistic regression setting.
Hugo Bowne-Anderson
May 3, 2016
Preprocessing in Data Science (Part 1): Centering, Scaling, and KNN
This article will explain the importance of preprocessing in the machine learning pipeline by examining how centering and scaling can improve model performance.
Hugo Bowne-Anderson
April 26, 2016