Learning Apache Spark

Name: Learning Apache Spark
Price: 9 USD
Availability: InStock

1h 44m 4s

English

Paid

June 24, 2025

Course description

After building data pipelines, data processing is one of the most important tasks in Data Engineering. As a data engineer, you constantly face the need for processing and it's crucial to be able to configure a powerful and distributed processing system. One of the most useful and widely used tools for this is Apache Spark.

Watch Online

Join premium to watch

Go to premium

#	Title	Duration
1	Introduction & Contents	03:31
2	Why Spark - Vertical vs Horizontal Scaling	03:56
3	What Spark Is Good For	04:46
4	Spark Driver, Context & Executors	04:12
5	Cluster Types	02:00
6	Client vs Cluster Deployment	06:12
7	Where to Run Spark	03:39
8	Tools in the Spark Course	02:36
9	The Dataset	04:13
10	Docker Setup	02:53
11	Jupyter Notebook Setup & Run	05:32
12	RDDs	03:58
13	DataFrames	01:41
14	Transformations & Actions Overview	03:00
15	Transformations	02:23
16	Actions	03:07
17	Notebook 1: JSON Transformations	09:53
18	Notebook 2: Working with Schemas	08:24
19	Notebook 3: Working With DataFrames	10:10
20	Notebook 4: SparkSQL	05:05
21	Notebook 5: Working with RDDs	12:53

Comments

0 comments

Want to join the conversation?

Similar courses

Azure Data Pipelines with Terraform

Sources: Andreas Kretz

Azure is becoming an increasingly popular platform for companies using the Microsoft365 ecosystem. If you want to enhance your data engineering skills...

4 hours 20 minutes 29 seconds

Streaming with Kafka & Spark

Sources: Andreas Kretz

This course is a comprehensive project with a full cycle of real-time data processing. You will work with data from an online store, including invoices...

2 hours 46 minutes 25 seconds

Dockerized ETL With AWS, TDengine & Grafana

Sources: Andreas Kretz

Data engineers often need to quickly set up a simple ETL script that just does its job. In this project, you will learn how to easily implement...

29 minutes 12 seconds

MongoDB Fundamentals

Sources: Andreas Kretz

Document-oriented databases are rapidly gaining popularity among NoSQL solutions. Working with JSON documents in MongoDB is convenient, flexible, and...

1 hour 23 minutes 19 seconds

Data Engineering with Hadoop

Sources: Suyog Nagaokar

Big Data is not just a buzzword but a real phenomenon. Every day, companies around the world collect and process massive volumes of data at a high...

7 hours 3 minutes