课程
Foundations of PySpark
中级技能水平
更新时间 2025年3月
SparkData Engineering4小时45 道练习3,850 XP150K+成就证明
创建您的免费帐户
继续使用 Google显示更多选项或
继续操作即表示您接受我们的《使用条款》和《隐私政策》,并同意您的数据存储在美国。
深受数千家公司学习者的喜爱
需要团队培训?
企业版试用课程描述
先决条件
Introduction to Python1
Getting to know PySpark
In this chapter, you'll learn how Spark manages data and how can you read and write tables from Python.
2
Manipulating data
In this chapter, you'll learn about the pyspark.sql module, which provides optimized data queries to your Spark session.
3
Getting started with machine learning pipelines
PySpark has built-in, cutting-edge machine learning routines, along with utilities to create full machine learning pipelines. You'll learn about them in this chapter.
4
Model tuning and selection
In this last chapter, you'll apply what you've learned to create a model that predicts which flights will be delayed.
Foundations of PySpark
课程完成 加入超过19百万学习者,今天就开始Foundations of PySpark!
创建您的免费帐户
继续使用 Google显示更多选项或
继续操作即表示您接受我们的《使用条款》和《隐私政策》,并同意您的数据存储在美国。
通过 DataCamp for Mobile 提升您的数据技能
随时随地通过我们的移动课程和每日 5 分钟编程挑战提升技能。