Learning Apache Spark

Name: Learning Apache Spark
Price: 9 USD
Availability: InStock

1h 44m 4s

English

Paid

June 24, 2025

Course description

After building data pipelines, data processing is one of the most important tasks in Data Engineering. As a data engineer, you constantly face the need for processing and it's crucial to be able to configure a powerful and distributed processing system. One of the most useful and widely used tools for this is Apache Spark.

Watch Online

0:00

#1: Introduction & Contents

All Course Lessons (21)

#	Lesson Title	Duration
1	Introduction & Contents Demo	03:31
2	Why Spark - Vertical vs Horizontal Scaling	03:56
3	What Spark Is Good For	04:46
4	Spark Driver, Context & Executors	04:12
5	Cluster Types	02:00
6	Client vs Cluster Deployment	06:12
7	Where to Run Spark	03:39
8	Tools in the Spark Course	02:36
9	The Dataset	04:13
10	Docker Setup	02:53
11	Jupyter Notebook Setup & Run	05:32
12	RDDs	03:58
13	DataFrames	01:41
14	Transformations & Actions Overview	03:00
15	Transformations	02:23
16	Actions	03:07
17	Notebook 1: JSON Transformations	09:53
18	Notebook 2: Working with Schemas	08:24
19	Notebook 3: Working With DataFrames	10:10
20	Notebook 4: SparkSQL	05:05
21	Notebook 5: Working with RDDs	12:53

Unlock unlimited learning

Get instant access to all 20 lessons in this course, plus thousands of other premium courses. One subscription, unlimited knowledge.

Learn more about subscription

Comments

0 comments

Want to join the conversation?

Similar courses

Python for Data Engineers

Sources: Andreas Kretz

If you want to take your skills in Data Engineering to the next level - you are in the right place. Python has become the primary language for data analysis...

2 hours 21 minutes 18 seconds

Mathematical Foundations of Machine Learning

Sources: udemy

Mathematics forms the core of data science and machine learning. Thus, to be the best data scientist you can be, you must have a working understanding of the mo

16 hours 25 minutes 26 seconds

Streaming with Kafka & Spark

Sources: Andreas Kretz

This course is a comprehensive project with a full cycle of real-time data processing. You will work with data from an online store, including invoices...

2 hours 46 minutes 25 seconds

Statistics for Data Science and Business Analysis

Sources: udemy

Is statistics a driving force in the industry you want to enter? Do you want to work as a Marketing Analyst, a Business Intelligence Analyst, a Data Analyst, or

4 hours 49 minutes 30 seconds

Relational Data Modeling

Sources: Eka Ponkratova

Relational modeling is widely used in building transactional databases. You might say, "But I'm not planning to become a backend engineer."

1 hour 52 minutes