Course
Scalable Data Processing in R
Create Your Free Account
or
By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.Loved by learners at thousands of companies
Training 2 or more people?
Try DataCamp for BusinessCourse Description
Prerequisites
Writing Efficient R CodeWorking with increasingly large data sets
Processing and Analyzing Data with bigmemory
Working with iotools
Case Study: A Preliminary Analysis of the Housing Data
Complete
Earn Statement of Accomplishment
Add this credential to your LinkedIn profile, resume, or CVShare it on social media and in your performance reviewEnroll Now
FAQs
What real-world dataset is used throughout this course?
The course uses Federal Housing Finance Agency data covering all mortgages held or securitized by Fannie Mae and Freddie Mac from 2009 to 2015.
Which R packages does this course teach for handling large datasets?
The course focuses on the bigmemory package for working with data larger than RAM and the iotools package for chunk-wise processing of both numeric and string data.
Why can't I just use standard R functions for very large datasets?
R stores all variables in memory by default, so datasets larger than available RAM cannot be handled with base R. This course teaches disk-based and chunk-wise processing as alternatives.
Does the course cover the split-apply-combine approach for big data?
Yes, you will implement split-apply-combine using bigmemory, learning how to explore and analyze large datasets through table creation and grouped computations.
What prior R experience is recommended before taking this course?
You should have completed Introduction to R, Intermediate R, and Writing Efficient R Code. Familiarity with writing efficient code is especially important for this course.
Join over 19 million learners and start Scalable Data Processing in R today!
Create Your Free Account
or
By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.Grow your data skills with DataCamp for Mobile
Make progress on the go with our mobile courses and daily 5-minute coding challenges.