Modern Data Warehouses & Data Lakes
58m 9s
English
Paid
Course description
As a data engineer, you will regularly work with analytical platforms where companies store data in Data Lakes and Data Warehouses for building visualizations and creating machine learning models.
Modern data warehouses, such as AWS Redshift, Google BigQuery, and Snowflake, allow you to load data directly from files in a Data Lake. This integration makes working with warehouses flexible and convenient for analytical tasks.
Read more about the course
In this course you will learn:
- How to use Data Lakes, Data Warehouses, and BI tools in a unified system
- How to load data into Data Lakes and visualize it in reports
- How to build integrations in Google Cloud Platform and AWS
- How ETL/ELT architecture works and how to apply it in modern data warehouses
Basics of Data Warehouses and Data Lakes
- The role of data warehouses in analytical platforms
- How data is loaded into Data Warehouse through ETL/ELT
- What Data Lakes are and how to use them
- How to work with files directly in the data lake
Practice on GCP: Cloud Storage, BigQuery, and Data Studio
- Setting up Cloud Storage, creating a table in BigQuery
- Data visualization in Data Studio
- Understanding the general principles of cloud platforms
Practice on AWS: S3, Athena, Glue, and Quicksight
- Creating data integration through S3, Athena, and Quicksight
- Setting up Glue Data Catalog for data management
- Detailed setup and integration of Glue
Summary and bonus lesson: AWS Redshift Spectrum
- Course summary
- Additional module on working with Redshift Spectrum using the prepared Data Catalog from the AWS project
Required knowledge
- Basics of working with Data Warehouses (it is recommended to take the "Data Warehouses" course in the academy)
- Basic knowledge of AWS Athena and Redshift (for the block with Redshift Spectrum, a prepared Data Catalog from the AWS project is used)
This course will help you master modern approaches to building data storage and processing systems and learn how to effectively use the capabilities of Data Lakes and Data Warehouses for analytics.
Watch Online
Watch Online Modern Data Warehouses & Data Lakes
0:00
/ #1: Introduction
All Course Lessons (14)
| # | Lesson Title | Duration | Access |
|---|---|---|---|
| 1 | Introduction Demo | 02:14 | |
| 2 | Data Science Platform | 04:11 | |
| 3 | ETL & ELT Data Warehouse | 06:23 | |
| 4 | Data Lake & Data Warehouse integration | 03:30 | |
| 5 | GCP & AWS Piplines we build | 03:15 | |
| 6 | GCP hands on Cloud Storage & BigQuery | 08:36 | |
| 7 | GCP hands on create Data Studio dashboard | 07:34 | |
| 8 | GCP Recap & AWS goals | 02:13 | |
| 9 | AWS Setup & upload data to S3 | 02:13 | |
| 10 | Athena Data Lake manual table configuration | 03:49 | |
| 11 | Creating a Quicksight dashboard | 05:06 | |
| 12 | Athena configuration using AWS Glue data catalog | 03:30 | |
| 13 | Course recap | 02:37 | |
| 14 | BONUS Configure Redshift Spectrum table with S3 | 02:58 |
Unlock unlimited learning
Get instant access to all 13 lessons in this course, plus thousands of other premium courses. One subscription, unlimited knowledge.
Learn more about subscriptionComments
0 commentsSimilar courses

MongoDB Fundamentals
Sources: Andreas Kretz
Document-oriented databases are rapidly gaining popularity among NoSQL solutions. Working with JSON documents in MongoDB is convenient, flexible, and...
1 hour 23 minutes 19 seconds

Machine Learning A-Z : Become Kaggle Master
Sources: udemy
Want to become a good Data Scientist? Then this is a right course for you. This course has been designed by IIT professionals who have mastered in Mathematics and Data Science....
36 hours 23 minutes 54 seconds

Statistics for Data Science and Business Analysis
Sources: udemy
Is statistics a driving force in the industry you want to enter? Do you want to work as a Marketing Analyst, a Business Intelligence Analyst, a Data Analyst, or
4 hours 49 minutes 30 seconds

2022 Python for Machine Learning & Data Science Masterclass
Sources: udemy
Welcome to the most complete course on learning Data Science and Machine Learning on the internet! After teaching over 2 million students I've worked for over a
44 hours 5 minutes 31 seconds

The Data Science Course: Complete Data Science Bootcamp 2023
Sources: udemy
Data scientist is one of the best suited professions to thrive this century. It is digital, programming-oriented, and analytical. Therefore, it comes as no surp
31 hours 14 minutes 14 seconds
Want to join the conversation?
Sign in to comment