dbt for Data Engineers

1h 52m 55s
English
Paid

dbt (data build tool) is a data transformation tool that prioritizes SQL. It allows for simple and transparent transformation, testing, and documentation of data directly within the warehouse. Thanks to dbt, teams can create reliable datasets for analytics, machine learning, and business processes without the need to export data externally. This is why dbt is becoming a key tool for data engineers, and this course is the perfect starting point for mastering it.

Read more about the course

Introduction to dbt

Before the practice, you will learn:

  • The difference between ETL and ELT,
  • The challenges faced by modern pipelines,
  • How dbt Core and dbt Cloud differ and their key advantages.

Setup: Snowflake, dbt Core, and GitHub

For the practice, you will:

  • Create a repository on GitHub,
  • Create an account in dbt Cloud and set up a data warehouse in Snowflake,
  • Perform basic configuration of the project in dbt and define the model structure (SQL or Python file).

Building Data Pipelines in dbt

You will create a chain of models (pipelines) using an e-commerce dataset. You will use dbt Core, dbt Cloud, and Snowflake for step-by-step data transformation.

Materializations in dbt

After building the models, you will learn how to save transformation results:

  • In tables,
  • Views,
  • Incremental or ephemeral models.

You will also learn how external and internal dbt sources work and the dependencies between them.

Testing dbt Models

You will learn how to test models - a key part of reliable data work:

  • Generic and bespoke tests,
  • Quality and consistency checks of data at all pipeline stages.

Deployment and Scheduling Models

Now that models are working locally, you will learn how to:

  • Share them with the team,
  • Run them on a schedule,
  • Update models automatically.

You will explore practices for deployment and scheduling in dbt Cloud.

Advanced dbt Features

At the end of the course:

  • Set up CI/CD processes directly in dbt Cloud,
  • Generate complete project documentation and understand how to use it within a team,
  • Learn about best practices for working with dbt in production.

What the Course Includes

  • Source code repository (GitHub)
  • E-commerce dataset
  • Step-by-step video tutorials
  • A selection of useful links and additional materials

Requirements

  • Basic knowledge of relational databases
  • Ability to work with SQL
  • Recommended: basic experience with Git and cloud platforms (Snowflake, dbt Cloud)

Watch Online dbt for Data Engineers

Join premium to watch
Go to premium
# Title Duration
1 Introduction 02:24
2 Modern data experience 05:43
3 Introduction to dbt 04:39
4 Goals of this course 04:51
5 Snowflake preparation 07:30
6 Loading data into Snowflake 09:36
7 Setup dbt Core 03:33
8 Preparing the GitHub repository 06:17
9 dbt models & materialization explained 05:49
10 Creating your first sql model 05:29
11 Working with custom schemas 04:36
12 Creating your first python model 01:56
13 dbt sources 04:04
14 Configuring sources 04:21
15 Working with seed files 03:20
16 Generic tests 03:26
17 Tests with Great Expectations 02:50
18 Writing custom generic tests 07:26
19 dbt cloud setup 05:15
20 creating dbt jobs 10:53
21 CI/CD automation with dbt cloud and GitHub 07:39
22 Documenation in dbt 01:18
23 Conclusion 00:00

Similar courses to dbt for Data Engineers

Statistics for Data Science and Business Analysis

Statistics for Data Science and Business Analysisudemy

Category: Data processing and analysis
Duration 4 hours 49 minutes 30 seconds
The Data Science Course: Complete Data Science Bootcamp 2023

The Data Science Course: Complete Data Science Bootcamp 2023udemy

Category: Data processing and analysis
Duration 31 hours 14 minutes 14 seconds
Machine Learning Design Questions

Machine Learning Design Questionsalgoexpert

Category: Data processing and analysis
Duration 3 hours 3 minutes 57 seconds
Dockerized ETL With AWS, TDengine & Grafana

Dockerized ETL With AWS, TDengine & GrafanaAndreas Kretz

Category: Data processing and analysis
Duration 29 minutes 12 seconds
TensorFlow Developer Certificate in 2023: Zero to Mastery

TensorFlow Developer Certificate in 2023: Zero to Masteryzerotomastery.io

Category: Data processing and analysis
Duration 62 hours 43 minutes 54 seconds
Data Engineering on Azure

Data Engineering on AzureKristijan Bakarić

Category: Azure, Data processing and analysis
Duration 1 hour 20 minutes 57 seconds
Data Analysis for Beginners: Excel & Pivot Tables

Data Analysis for Beginners: Excel & Pivot Tableszerotomastery.io

Category: Data processing and analysis
Duration 2 hours 10 minutes 21 seconds
Data Engineering on GCP

Data Engineering on GCPAndreas Kretz

Category: Data processing and analysis
Duration 1 hour 17 minutes 33 seconds
Relational Data Modeling

Relational Data ModelingEka Ponkratova

Category: Data processing and analysis
Duration 1 hour 52 minutes
Dimensional Data Modeling

Dimensional Data ModelingEka Ponkratova

Category: Data processing and analysis
Duration 1 hour 37 minutes 57 seconds