Scalable Data Processing in R

Language	English
Level	Beginner
Access	Paid
Certificate	Paid

Datacamp

Category: Programming languages

Learn how to write scalable code for working with big data in R using the bigmemory and iotools packages.

Add your review

Description
Reviews (0)
Report

Course Description

Datasets are often larger than available RAM, which causes problems for R programmers since by default all the variables are stored in memory. You’ll learn tools for processing, exploring, and analyzing data directly from disk. You’ll also implement the split-apply-combine approach and learn how to write scalable code using the bigmemory and iotools packages. In this course, you’ll make use of the Federal Housing Finance Agency’s data, a publicly available data set chronicling all mortgages that were held or securitized by both Federal National Mortgage Association (Fannie Mae) and Federal Home Loan Mortgage Corporation (Freddie Mac) from 2009-2015.

What You’ll Learn

Working with increasingly large data sets

In this chapter, we cover the reasons you need to apply new techniques when data sets are larger than available RAM. We show that importing and exporting data using the base R functions can be slow and some easy ways to remedy this. Finally, we introduce the bigmemory package.

Working with iotools

We’ll use the iotools package that can process both numeric and string data, and introduce the concept of chunk-wise processing.

Processing and Analyzing Data with bigmemory

Now that you’ve got some experience using bigmemory, we’re going to go through some simple data exploration and analysis techniques. In particular, we’ll see how to create tables and implement the split-apply-combine approach.

Case Study: A Preliminary Analysis of the Housing Data

In the previous chapters, we’ve introduced the housing data and shown how to compute with data that is about as big, or bigger than, the amount of RAM on a single machine. In this chapter, we’ll go through a preliminary analysis of the data, comparing various trends over time.

User Reviews

0.0 out of 5

★★★★★

Write a review

There are no reviews yet.

Be the first to review “Scalable Data Processing in R” Cancel reply

You must be logged in to post a review.

Report this page

Scalable Data Processing in R

Description
Reviews (0)
Report

Visit Course

Scalable Data Processing in R

Course Description

What You’ll Learn

Working with increasingly large data sets

Working with iotools

Processing and Analyzing Data with bigmemory

Case Study: A Preliminary Analysis of the Housing Data

User Reviews

Be the first to review “Scalable Data Processing in R” Cancel reply

Python 3 Programming Specialization

Introduction to Object-Oriented Programming with Java

Introduction to C# Programming and Unity

Scalable Data Processing in R

Course Description

What You’ll Learn

Working with increasingly large data sets

Working with iotools

Processing and Analyzing Data with bigmemory

Case Study: A Preliminary Analysis of the Housing Data

User Reviews

Be the first to review “Scalable Data Processing in R” Cancel reply

Related Products

Python 3 Programming Specialization

Introduction to Object-Oriented Programming with Java

Introduction to C# Programming and Unity