跳至内容
首页Python

学习路径

Big Data with PySpark

更新时间 2026年3月
Master how to process big data and leverage it efficiently with Apache Spark using the PySpark API.
免费开始学习路径
PythonImporting & Cleaning Data25 小时8,447

创建您的免费帐户

继续操作即表示您接受我们的《使用条款》和《隐私政策》,并同意您的数据存储在美国。

深受数千家公司学习者的喜爱

Group

培训2人或更多?

试用DataCamp for Business

学习路径描述

Big Data with PySpark

Advance your data skills by mastering Apache Spark. Using the Spark Python API, PySpark, you will leverage parallel computation with large datasets, and get ready for high-performance machine learning. From cleaning data to creating features and implementing machine learning models, you'll execute end-to-end workflows with Spark. The track ends with building a recommendation engine using the popular MovieLens dataset and the Million Songs dataset.

先决条件

此学习路径无先决条件
  • Course

    1

    Introduction to PySpark

    Master PySpark to handle big data with ease—learn to process, query, and optimize massive datasets for powerful analytics!

  • Course

    Learn the gritty details that data scientists are spending 70-80% of their time on; data wrangling and feature engineering.

  • Course

    Learn how to make predictions from data with Apache Spark, using decision trees, logistic regression, linear regression, ensembles, and pipelines.

  • Project

    额外

    Building a Demand Forecasting Model

    Use PySpark to build an e-commerce forecasting model!

Big Data with PySpark
6 课程
学习路径完成

获得成就证明

将此证书添加到你的 LinkedIn 档案、简历或履历中
在社交媒体和绩效评估中分享
立即注册

加入超过19百万学习者,今天就开始Big Data with PySpark !

创建您的免费帐户

继续操作即表示您接受我们的《使用条款》和《隐私政策》,并同意您的数据存储在美国。

通过 DataCamp for Mobile 提升您的数据技能

随时随地通过我们的移动课程和每日 5 分钟编程挑战提升技能。