Skip to main content
Category
Topics

Big Data Tutorials

Learn how to implement the tools that data scientists are using to handle, process, & analyse big data. Gain insight to help upskill your career & team.
Other topics:
Big Data

PySpark: How to Drop a Column From a DataFrame

In PySpark, we can drop one or more columns from a DataFrame using the .drop("column_name") method for a single column or .drop(["column1", "column2", ...]) for multiple columns.

Maria Eugenia Inzaugarat

June 16, 2024

Data Engineering

Building an ETL Pipeline with Airflow

Master the basics of extracting, transforming, and loading data with Apache Airflow.
Jake Roach's photo

Jake Roach

May 3, 2024

Big Data

Snowflake Snowpark: A Comprehensive Introduction

Take the first steps to master in-database machine learning using Snowflake Snowpark.
Bex Tuychiev's photo

Bex Tuychiev

May 2, 2024

Big Data

A Beginner's Guide to BigQuery

Learn what BigQuery is, how it works, its differences from traditional data warehouses, and how to use the BigQuery console to query public datasets provided by Google.
Eduardo Oliveira's photo

Eduardo Oliveira

September 13, 2023

Python

Lasso and Ridge Regression in Python Tutorial

Learn about the lasso and ridge techniques of regression. Compare and analyse the methods in detail.
DataCamp Team's photo

DataCamp Team

March 25, 2022

Big Data

Cloudera Hadoop Tutorial

Learn about Hadoop ecosystem, the architectures and how to start with Cloudera.
DataCamp Team's photo

DataCamp Team

March 4, 2022

SQL

How to Use GROUP BY and HAVING in SQL

An intuitive guide for discovering the two most popular SQL commands to aggregate rows of your dataset
Eugenia Anello's photo

Eugenia Anello

February 21, 2023

Python

Python Dictionary Comprehension Tutorial

Learn all about Python dictionary comprehension: how you can use it to create dictionaries, to replace (nested) for loops or lambda functions with map(), filter() and reduce(), ...!
Sejal Jaiswal's photo

Sejal Jaiswal

February 27, 2023