A Big Data Hadoop and Spark project for absolute beginners

- 85%

Certificate	Paid
Language	English
Level	Beginner

Last updated on March 30, 2025 7:12 am

A Big Data Hadoop and Spark project for absolute beginners

udemy.com

Category: IT certifications

Learn Big Data, Hadoop, and Spark from scratch by solving a real-world use case using Python and Scala. Prepare for a Data Engineer role and gain hands-on experience with Hadoop, Hive, and Spark. Cleanse and analyze large volumes of data using Big Data technology. Ideal for beginners and experienced professionals transitioning to a Big Data role.

Add your review

Description
Reviews (0)
Report

What you’ll learn

Big Data , Hadoop and Spark from scratch by solving a real world use case using Python and Scala
Spark Scala & PySpark real world coding framework.
Real world coding best practices, logging, error handling , configuration management using both Scala and Python.
Serverless big data solution using AWS Glue, Athena and S3

This course will prepare you for a real world Data Engineer role !

Get started with Big Data quickly leveraging free cloud cluster and solving a real world use case! Learn Hadoop, Hive , Spark (both Python and Scala) from scratch!

Learn to code Spark Scala & PySpark like a real world developer. Understand real world coding best practices, logging, error handling , configuration management using both Scala and Python.

Project

A bank is launching a new credit card and wants to identify prospects it can target in its marketing campaign.

It has received prospect data from various internal and 3rd party sources. The data has various issues such as missing or unknown values in certain fields. The data needs to be cleansed before any kind of analysis can be done.

Since the data is in huge volume with billions of records, the bank has asked you to use Big Data Hadoop and Spark technology to cleanse, transform and analyze this data.

What you will learn :

Big Data, Hadoop concepts
How to create a free Hadoop and Spark cluster using Google Dataproc
Hadoop hands-on – HDFS, Hive
Python basics
PySpark RDD – hands-on
PySpark SQL, DataFrame – hands-on
Project work using PySpark and Hive
Scala basics
Spark Scala DataFrame
Project work using Spark Scala
Spark Scala Real world coding framework and development using Winutil, Maven and IntelliJ.
Python Spark Hadoop Hive coding framework and development using PyCharm
Building a data pipeline using Hive , PostgreSQL, Spark
Logging , error handling and unit testing of PySpark and Spark Scala applications
Leveraging ChatGPT for faster development (An example)
Creating a Databricks Community Edition account to practice Spark
Spark Scala Structured Streaming
Applying spark transformation on data stored in AWS S3 using Glue and viewing data using Athena

Prerequisites :

Some basic programming skills
Some knowledge of SQL queries

Who this course is for:

Beginners who want to learn Big Data or experienced people who want to transition to a Big Data role
Big data beginners who want to learn how to code in the real world

User Reviews

0.0 out of 5

★★★★★

Write a review

There are no reviews yet.

Be the first to review “A Big Data Hadoop and Spark project for absolute beginners” Cancel reply

You must be logged in to post a review.

Report this page

A Big Data Hadoop and Spark project for absolute beginners

Description
Reviews (0)
Report

Go to Class

A Big Data Hadoop and Spark project for absolute beginners

What you’ll learn

Who this course is for:

User Reviews

Be the first to review “A Big Data Hadoop and Spark project for absolute beginners” Cancel reply

SAP Enable Now C_SEN2205 Training for Certification

Scrum Master 2 Certification: 6 Practice Tests

Exam SC-100 Cybersecurity Architect Practice Test -JAN 2023

A Big Data Hadoop and Spark project for absolute beginners

What you’ll learn

Who this course is for:

User Reviews

Be the first to review “A Big Data Hadoop and Spark project for absolute beginners” Cancel reply

Related Products

SAP Enable Now C_SEN2205 Training for Certification

Scrum Master 2 Certification: 6 Practice Tests

Exam SC-100 Cybersecurity Architect Practice Test -JAN 2023