Feature Engineering with PySpark

Language	English
Level	Beginner
Access	Paid
Certificate	Paid

Datacamp

Category: Data Science

Learn the gritty details that data scientists are spending 70-80% of their time on; data wrangling and feature engineering.

Add your review

Description
Reviews (0)
Report

Course Description

The real world is messy and your job is to make sense of it. Toy datasets like MTCars and Iris are the result of careful curation and cleaning, even so the data needs to be transformed for it to be useful for powerful machine learning algorithms to extract meaning, forecast, classify or cluster. This course will cover the gritty details that data scientists are spending 70-80% of their time on

data wrangling and feature engineering. With size of datasets now becoming ever larger, let’s use PySpark to cut this Big Data problem down to size!

What You’ll Learn

Exploratory Data Analysis

Get to know a bit about your problem before you dive in! Then learn how to statistically and visually inspect your dataset!

Feature Engineering

In this chapter learn how to create new features for your machine learning model to learn from. We’ll look at generating them by combining fields, extracting values from messy columns or encoding them for better results.

Wrangling with Spark Functions

Real data is rarely clean and ready for analysis. In this chapter learn to remove unneeded information, handle missing values and add additional data to your analysis.

Building a Model

In this chapter we’ll learn how to choose which type of model we want. Then we will learn how to apply our data to the model and evaluate it. Lastly, we’ll learn how to interpret the results and save the model for later!

User Reviews

0.0 out of 5

★★★★★

Write a review

There are no reviews yet.

Be the first to review “Feature Engineering with PySpark” Cancel reply

You must be logged in to post a review.

Report this page

Feature Engineering with PySpark

Description
Reviews (0)
Report

Visit Course

Feature Engineering with PySpark

Course Description

What You’ll Learn

Exploratory Data Analysis

Feature Engineering

Wrangling with Spark Functions

Building a Model

User Reviews

Be the first to review “Feature Engineering with PySpark” Cancel reply

NoSQL, Big Data, and Spark Foundations Specialization

Big Data Analytical Platform on Alibaba Cloud

Getting Started with Data Analytics on AWS

Feature Engineering with PySpark

Course Description

What You’ll Learn

Exploratory Data Analysis

Feature Engineering

Wrangling with Spark Functions

Building a Model

User Reviews

Be the first to review “Feature Engineering with PySpark” Cancel reply

Related Products

NoSQL, Big Data, and Spark Foundations Specialization

Big Data Analytical Platform on Alibaba Cloud

Getting Started with Data Analytics on AWS