must read
importing & cleaning data
+2

Generating Realistic Random Datasets with Trumania

Why do data scientists and data engineers work with synthetic data? How do they obtain it? Discover Trumania, a scenario-based random dataset generator library.
must read
importing & cleaning data
+2
25
25
must read
fun
+5

Lyric Analysis with NLP & Machine Learning with R

Dive into the lyrics of Prince's music with R: use text mining and Exploratory Data Analysis (EDA) to shed insight on The Artist's career.
158
158
must read
machine learning

Transfer Learning: Leverage Insights from Big Data

In this tutorial, you’ll see what transfer learning is, what some of its applications are and why it is critical skill as a data scientist.
must read
machine learning
46
46
must read
python
+3

Time Series Analysis Tutorial with Python

Get Google Trends data of keywords such as 'diet' and 'gym' and see how they vary over time while learning about trends and seasonality in time series data.
112
112
must read
shell

8 Useful Shell Commands For Data Science

Which shell commands do data scientists use nearly every day? Discover and learn how to use them in this tutorial!
133
133
must read
machine learning
+4

Kaggle Tutorial: Your First Machine Learning Model

Learn how to build your first machine learning model, a decision tree classifier, with the Python scikit-learn package, submit it to Kaggle and see how it performs!
109
109
must read
r programming

Five Tips to Improve Your R Code

Five useful tips that you can use to effectively improve your R code, from using seq() to create sequences to ditching which() and much more!
must read
r programming
124
124
must read
r programming
+2

Pipes in R Tutorial For Beginners

Learn more about the famous pipe operator %>% and other pipes in R, why and how you should use them and what alternatives you can consider!
157
157
must read
data manipulation
+2

Groupby, split-apply-combine and pandas

In this tutorial, you'll learn how to use the pandas groupby operation, which draws from the well-known split-apply-combine strategy, on Netflix movie data.
108
108
must read
machine learning
+4

Detecting Fake News with Scikit-Learn

This scikit-learn tutorial will walk you through building a fake news classifier with the help of Bayesian models.
must read
machine learning
+4
93
93
must read
r programming
+1

15 Easy Solutions To Your Data Frame Problems In R

Discover how to create a data frame in R, change column and row names, access values, attach data frames, apply functions and much more.
54
54
must read
learning data science
+1

The Data Science Industry: Who Does What (Infographic)

This infograph compares the roles of data scientists, data analysts, data architects, data engineers and more in the data science industry.
must read
learning data science
+1
64
64