Skip to main content

Modern Data Warehouses & Data Lakes

58m 9s
English
Paid

Course description

As a data engineer, you will regularly work with analytical platforms where companies store data in Data Lakes and Data Warehouses for building visualizations and creating machine learning models. Modern data warehouses, such as AWS Redshift, Google BigQuery, and Snowflake, allow you to load data directly from files in a Data Lake. This integration makes working with warehouses flexible and convenient for analytical tasks.
Read more about the course

In this course you will learn:

  • How to use Data Lakes, Data Warehouses, and BI tools in a unified system
  • How to load data into Data Lakes and visualize it in reports
  • How to build integrations in Google Cloud Platform and AWS
  • How ETL/ELT architecture works and how to apply it in modern data warehouses

Basics of Data Warehouses and Data Lakes

  • The role of data warehouses in analytical platforms
  • How data is loaded into Data Warehouse through ETL/ELT
  • What Data Lakes are and how to use them
  • How to work with files directly in the data lake

Practice on GCP: Cloud Storage, BigQuery, and Data Studio

  • Setting up Cloud Storage, creating a table in BigQuery
  • Data visualization in Data Studio
  • Understanding the general principles of cloud platforms

Practice on AWS: S3, Athena, Glue, and Quicksight

  • Creating data integration through S3, Athena, and Quicksight
  • Setting up Glue Data Catalog for data management
  • Detailed setup and integration of Glue

Summary and bonus lesson: AWS Redshift Spectrum

  • Course summary
  • Additional module on working with Redshift Spectrum using the prepared Data Catalog from the AWS project

Required knowledge

  • Basics of working with Data Warehouses (it is recommended to take the "Data Warehouses" course in the academy)
  • Basic knowledge of AWS Athena and Redshift (for the block with Redshift Spectrum, a prepared Data Catalog from the AWS project is used)

This course will help you master modern approaches to building data storage and processing systems and learn how to effectively use the capabilities of Data Lakes and Data Warehouses for analytics.

Watch Online

This is a demo lesson (10:00 remaining)

You can watch up to 10 minutes for free. Subscribe to unlock all 14 lessons in this course and access 10,000+ hours of premium content across all courses.

View Pricing
0:00
/
#1: Introduction

All Course Lessons (14)

#Lesson TitleDurationAccess
1
Introduction Demo
02:14
2
Data Science Platform
04:11
3
ETL & ELT Data Warehouse
06:23
4
Data Lake & Data Warehouse integration
03:30
5
GCP & AWS Piplines we build
03:15
6
GCP hands on Cloud Storage & BigQuery
08:36
7
GCP hands on create Data Studio dashboard
07:34
8
GCP Recap & AWS goals
02:13
9
AWS Setup & upload data to S3
02:13
10
Athena Data Lake manual table configuration
03:49
11
Creating a Quicksight dashboard
05:06
12
Athena configuration using AWS Glue data catalog
03:30
13
Course recap
02:37
14
BONUS Configure Redshift Spectrum table with S3
02:58

Unlock unlimited learning

Get instant access to all 13 lessons in this course, plus thousands of other premium courses. One subscription, unlimited knowledge.

Learn more about subscription

Comments

0 comments

Want to join the conversation?

Sign in to comment

Similar courses

The Data Science Course: Complete Data Science Bootcamp 2023

The Data Science Course: Complete Data Science Bootcamp 2023

Sources: udemy
Data scientist is one of the best suited professions to thrive this century. It is digital, programming-oriented, and analytical. Therefore, it comes as no surp
31 hours 14 minutes 14 seconds
PyTorch for Deep Learning with Python Bootcamp

PyTorch for Deep Learning with Python Bootcamp

Sources: udemy
Welcome to the best online course for learning about Deep Learning with Python and PyTorch! PyTorch is an open source deep learning platform that provides a sea
17 hours 2 minutes 14 seconds
Machine Learning & Containers on AWS

Machine Learning & Containers on AWS

Sources: Andreas Kretz
In this practical course, you will learn how to build a complete data pipeline on the AWS platform - from obtaining data from the Twitter API to analysis, stora
1 hour 33 minutes 34 seconds
Data Analysis for Beginners: Excel & Pivot Tables

Data Analysis for Beginners: Excel & Pivot Tables

Sources: zerotomastery.io
This short course on data analysis in Excel is perfect for beginners who want to acquire skills in analyzing structured data using two of Excel's most...
2 hours 10 minutes 21 seconds
SQL & Database Design A-Z™: Learn MS SQL Server + PostgreSQL

SQL & Database Design A-Z™: Learn MS SQL Server + PostgreSQL

Sources: udemy
Are you interested in a career in Data Science or Data Analytics? In that case, inevitably you are going to encounter databases in your work. But how do you int
12 hours 32 minutes 7 seconds