Skip to main content
CourseFlix

Apache Kafka Fundamentals

1h 4m 52s
English
Paid

Master the fundamentals of Apache Kafka in this comprehensive course designed to provide you with essential knowledge for a confident start. You will learn to configure a message queue, write producers and consumers, and understand Kafka's role within data and event processing architectures.

Understanding Kafka and Message Queues

Discover what Kafka is and its usage in stream and event processing systems. You will gain insights into the key components of Kafka, including topics, messages, and consumer groups. Learn how these components interact, how data is written and read from a message queue, and the significance of message order and delivery guarantees.

Exploring Apache Kafka Architecture

Dive deep into Kafka's architecture. Understand topic partitions and their relation to brokers. You will explore data processing within Kafka and learn about Zookeeper, its roles, and its interactions with Kafka brokers and metadata.

Setting Up Your Kafka Development Environment

Learn to run Kafka on a Windows environment using Docker. This section includes a step-by-step guide on setting up a Bitnami Kafka Docker container, complete with practical tips for successful installation and environment launch.

Hands-On Practice with Kafka

Set up your own Kafka topic and master the basic commands to manage it. You will create a producer to write messages and a consumer to read them. Test their functionality using Python and manage consumer offsets utilizing the offset checker.

Integrating Kafka into Data Processing Platforms

Conclude the course by exploring Kafka's integration into Data Science platforms. Examine three practical scenarios of using Kafka:

  • ETL ingest pipeline
  • Multiple consumer processes
  • Multistage stream processing

These examples will equip you with the knowledge to implement Kafka in your everyday work effectively.

Additional

https://hub.docker.com/r/bitnami/kafka

https://github.com/team-data-science/apache-kafka

About the Author: Andreas Kretz

Andreas Kretz thumbnail

Andreas Kretz is a German data engineer and one of the most widely followed independent voices on data engineering as a career discipline. He runs the Plumbers of Data Science brand and has been publishing tutorial material continuously since the field consolidated around the modern lake-house stack (Spark, Kafka, Snowflake, Databricks, Airflow).

His CourseFlix listing is the largest single-author catalog under this source — over thirty courses spanning data-pipeline construction, streaming architectures, the cloud-native data stack on AWS / Azure / GCP, the Python and Scala tooling that dominates the field, and the soft-skills / career side of breaking into data engineering. Material is paid and aimed at engineers transitioning into data work or already-working data engineers picking up specific tools.

Watch Online 15 lessons

This is a demo lesson (10:00 remaining)

You can watch up to 10 minutes for free. Subscribe to unlock all 15 lessons in this course and access 10,000+ hours of premium content across all courses.

View Pricing
0:00
/
#1: Introduction
All Course Lessons (15)
#Lesson TitleDurationAccess
1
Introduction Demo
02:16
2
What is Kafka
09:15
3
Basic Kafka Parts
04:25
4
Message Queue Basics
07:39
5
Topics Partitions & Brokers
02:16
6
Brokers & Zookeeper
04:40
7
Development Environment
02:42
8
Bitnami Docker Setup
03:34
9
Basic Topic Commands
04:16
10
Kafka Producer
05:56
11
Kafka Consumer
01:35
12
Testing Producer & Consumer
02:45
13
Working with Consumer Offsets
06:46
14
Examples How Kafka Fits in Data Platforms
05:25
15
Conclusion
01:22
Unlock unlimited learning

Get instant access to all 14 lessons in this course, plus thousands of other premium courses. One subscription, unlimited knowledge.

Learn more about subscription

Related courses

  • Data Structures and Algorithmic Trading: Machine Learning thumbnail

    Data Structures and Algorithmic Trading: Machine Learning

    Sources: Udemy
    Data Structures and Algorithmic trading is a method of executing orders using automated pre-programmed trading instructions over time. They were developed so th
    2 hours 20 minutes 32 seconds 5 / 5
  • The Data Bootcamp: Transform your Data using dbt™ thumbnail

    The Data Bootcamp: Transform your Data using dbt™

    Sources: Udemy
    Are you looking for a cutting-edge way to extract load and transform your data? Do you want to know more about dbt™ and how to use it? Well, this is the course
    4 hours 10 minutes 51 seconds 5 / 5
  • Data Engineering on Databricks thumbnail

    Data Engineering on Databricks

    Sources: Andreas Kretz
    Learn Databricks for data processing using Apache Spark. This course covers setup on AWS, ETL processes, data visualization, and BI tools integration.
    1 hour 27 minutes 29 seconds 5 / 5