Mike Metzger
Data Engineer Consultant @ Flexible Creations
Join us for this live, hands-on training where you will learn how to utilize the power of Python and Apache Spark for cleaning data. We'll work through a dataset with a myriad of common issues you would likely encounter while preparing the data for further processing or analysis. This includes handling malformed and missing data, using transformations, and a bit about validation of your datasets. This session will run for three hours, providing time to gain experience with Spark and data cleaning and will include short breaks and Q&A throughout.
You will learn how to:
Bring your questions regarding processing large amounts of data and a machine running a late version browser.
This course is open to all DataCamp Premium learners, looking to use Spark and Python to chew through and clean huge datasets. We recommend that you have taken the following course before attending:
Data Engineer Consultant @ Flexible Creations