Data Science Tutorials

Develop your data science skills with tutorials in our blog. We cover everything from intricate data visualizations in Tableau to version control features in Git.

Discover categories

Training 2 or more people?Try DataCamp for Business

SARIMA: A Complete Guide to Seasonal Time Series Forecasting

Learn how SARIMA extends ARIMA to handle seasonality, understand its seven parameters, and build a working model in Python from data collection through forecasting.

Vinod Chugani

July 31, 2026

Silhouette Score: How to Evaluate Clustering Quality

A hands-on guide to the silhouette score, covering the formula, interpretation ranges, a scikit-learn example, how to use it for choosing the right number of clusters, and how it compares to other clustering metrics.

July 31, 2026

A Complete Guide to Nano Banana 2 Lite and Gemini Omni Flash: Python Media Pipelines

Learn how to leverage Google's latest multimodal models to build high-performance, cost-effective image and video generation pipelines in Python.

François Aubry

July 30, 2026

How to Implement Semantic Search in MongoDB

Learn how to implement semantic search in MongoDB with Python. Generate vector embeddings, create a Vector Search index, and run $vectorSearch queries.

Damilola Oladele

July 30, 2026

Grok in Excel: A Hands-On Guide With 6 Practical Examples

Learn how to use Grok in Microsoft Excel to analyze data, generate formulas, clean spreadsheets, create charts, and run financial scenarios using natural-language prompts.

Khalid Abdelaty

July 30, 2026

Spurious Correlation: An Important Statistical Trap (and How to Avoid It)

Knowing why spurious relationships happen, from confounders to sampling bias, is what separates a real finding from a statistical coincidence.

Dario Radečić

July 29, 2026

Hyperparameter Tuning: Methods, Best Practices, and Examples

A practical guide to hyperparameter tuning, covering how it works, the main search methods (grid, random, Bayesian), evaluation strategies, and how to run it in Python and R.

Dario Radečić

July 28, 2026

How to Run Bonsai 27B Locally on 8GB Memory

Learn how to run Bonsai 27B, a compact 1-bit Qwen3.6-27B model, locally using llama.cpp, an OpenAI-compatible API, a web UI, and the Pi coding agent.

Abid Ali Awan

July 27, 2026

RAGFlow Explained: Build Production RAG Applications

A practical walkthrough of RAGFlow, the open-source platform for building production Retrieval Augmented Generation applications, covering its architecture, document parsing engine, retrieval strategies, agent workflows, and how it compares to LangChain and LlamaIndex.

Dario Radečić

July 27, 2026

The Claude Opus 5 API: Build and Benchmark a Coding Agent in Python

Learn how to use the Claude Opus 5 API in Python. Build a bug-fixing agent with tools, then benchmark all five effort levels on pass rate, tokens, and cost.

Khalid Abdelaty

July 27, 2026

Orchestrating AI Pipelines With Airflow

Learn how to turn a simple LLM prototype into a reliable, scheduled AI workflow with Apache Airflow.

Andrea Valenzuela

July 26, 2026

How to Run Thinking Machines’ Inkling Locally

Learn how to run Thinking Machines' Inkling locally. A complete guide to building experimental llama.cpp, downloading Unsloth GGUFs, and multi-GPU serving.

Abid Ali Awan

July 26, 2026