dbt for Data Engineers

1h 52m 55s
English
Paid

Course description

dbt (data build tool) is a data transformation tool that prioritizes SQL. It allows for simple and transparent transformation, testing, and documentation of data directly within the warehouse. Thanks to dbt, teams can create reliable datasets for analytics, machine learning, and business processes without the need to export data externally. This is why dbt is becoming a key tool for data engineers, and this course is the perfect starting point for mastering it.

Read more about the course

Introduction to dbt

Before the practice, you will learn:

  • The difference between ETL and ELT,
  • The challenges faced by modern pipelines,
  • How dbt Core and dbt Cloud differ and their key advantages.

Setup: Snowflake, dbt Core, and GitHub

For the practice, you will:

  • Create a repository on GitHub,
  • Create an account in dbt Cloud and set up a data warehouse in Snowflake,
  • Perform basic configuration of the project in dbt and define the model structure (SQL or Python file).

Building Data Pipelines in dbt

You will create a chain of models (pipelines) using an e-commerce dataset. You will use dbt Core, dbt Cloud, and Snowflake for step-by-step data transformation.

Materializations in dbt

After building the models, you will learn how to save transformation results:

  • In tables,
  • Views,
  • Incremental or ephemeral models.

You will also learn how external and internal dbt sources work and the dependencies between them.

Testing dbt Models

You will learn how to test models - a key part of reliable data work:

  • Generic and bespoke tests,
  • Quality and consistency checks of data at all pipeline stages.

Deployment and Scheduling Models

Now that models are working locally, you will learn how to:

  • Share them with the team,
  • Run them on a schedule,
  • Update models automatically.

You will explore practices for deployment and scheduling in dbt Cloud.

Advanced dbt Features

At the end of the course:

  • Set up CI/CD processes directly in dbt Cloud,
  • Generate complete project documentation and understand how to use it within a team,
  • Learn about best practices for working with dbt in production.

What the Course Includes

  • Source code repository (GitHub)
  • E-commerce dataset
  • Step-by-step video tutorials
  • A selection of useful links and additional materials

Requirements

  • Basic knowledge of relational databases
  • Ability to work with SQL
  • Recommended: basic experience with Git and cloud platforms (Snowflake, dbt Cloud)

Watch Online

This is a demo lesson (10:00 remaining)

You can watch up to 10 minutes for free. Subscribe to unlock all 23 lessons in this course and access 10,000+ hours of premium content across all courses.

View Pricing

Watch Online dbt for Data Engineers

0:00
/
#1: Introduction

All Course Lessons (23)

#Lesson TitleDurationAccess
1
Introduction Demo
02:24
2
Modern data experience
05:43
3
Introduction to dbt
04:39
4
Goals of this course
04:51
5
Snowflake preparation
07:30
6
Loading data into Snowflake
09:36
7
Setup dbt Core
03:33
8
Preparing the GitHub repository
06:17
9
dbt models & materialization explained
05:49
10
Creating your first sql model
05:29
11
Working with custom schemas
04:36
12
Creating your first python model
01:56
13
dbt sources
04:04
14
Configuring sources
04:21
15
Working with seed files
03:20
16
Generic tests
03:26
17
Tests with Great Expectations
02:50
18
Writing custom generic tests
07:26
19
dbt cloud setup
05:15
20
creating dbt jobs
10:53
21
CI/CD automation with dbt cloud and GitHub
07:39
22
Documenation in dbt
01:18
23
Conclusion
00:00

Unlock unlimited learning

Get instant access to all 22 lessons in this course, plus thousands of other premium courses. One subscription, unlimited knowledge.

Learn more about subscription

Comments

0 comments

Want to join the conversation?

Sign in to comment

Similar courses

Machine Learning A-Z : Become Kaggle Master

Machine Learning A-Z : Become Kaggle Master

Sources: udemy
Want to become a good Data Scientist? Then this is a right course for you. This course has been designed by IIT professionals who have mastered in Mathematics and Data Science....
36 hours 23 minutes 54 seconds
Streaming with Kafka & Spark

Streaming with Kafka & Spark

Sources: Andreas Kretz
This course is a comprehensive project with a full cycle of real-time data processing. You will work with data from an online store, including invoices...
2 hours 46 minutes 25 seconds
The Data Engineering Bootcamp: Zero to Mastery

The Data Engineering Bootcamp: Zero to Mastery

Sources: zerotomastery.io
Learn to build streaming pipelines with Apache Kafka and Flink, create data lakes on AWS, run ML workflows on Spark, and integrate LLM models into...
13 hours 23 minutes 15 seconds
Data Analysis with Pandas and Python

Data Analysis with Pandas and Python

Sources: udemy
Welcome to the most comprehensive Pandas course available on Udemy! An excellent choice for both beginners and experts looking to expand their knowledge on one of the most popul...
19 hours 5 minutes 40 seconds
Data Analysis for Beginners: Python & Statistics

Data Analysis for Beginners: Python & Statistics

Sources: zerotomastery.io
This course is your first step into the world of data analysis using one of the main tools for analysts - Python. Without complicated terms, advanced...
6 hours 34 minutes 20 seconds