Data Engineering on GCP
1h 17m 33s
English
Paid
Course description
Google Cloud Platform (GCP) is one of the most popular cloud platforms in the world, providing an extensive set of tools and services for building, managing, and optimizing data pipelines. GCP enables efficient storage, processing, analysis, and visualization of data, helping data engineers create scalable and high-performance solutions.
Read more about the course
What You Will Learn in the Course
In this practical course, you will step by step create your own project on GCP:
- Extract data from an external weather API
- Process it through a pipeline using GCP cloud services
- Store the data in a server database
- Create visualizations using Looker Studio
The course will help you master GCP from scratch, and the skills you acquire will also be useful when working with other cloud platforms such as AWS, as many of the services are quite similar.
In addition to the course, you will have access to a GitHub repository with a project overview and ready-made code snippets to help you quickly replicate the training examples.
Course Structure
- Project Data and Goals
- We will analyze the pipeline architecture, define the project goals, and get acquainted with the API for obtaining weather data. You will also learn how to set up an account in Google Cloud and activate the necessary services (by the way, Google provides $300 for free platform testing!).
- Project Preparation
- Create a project in Google Cloud, activate APIs, and set up schedules for task automation.
- Pipeline Creation: Extracting Data from API
- Set up the necessary resources for pipeline operation:
- A server database MySQL via Cloud SQL
- A virtual machine based on Linux via Compute Engine for database management
- Cloud Scheduler for scheduling API calls
- Server functions for data processing
- Pub/Sub message queue for data transfer between services
- Writing Data to the Database
- Learn to write server functions to store data in MySQL, test the writing process, and ensure that data is stored correctly.
- Data Visualization
- Set up Looker Studio to create clear visualizations: build bubble charts, time series, and organize weather data monitoring.
This course will give you practical experience with Google Cloud Platform tools and help develop key skills for working as a data engineer.
Watch Online
Join premium to watch
Go to premium
# | Title | Duration |
---|---|---|
1 | Introduction | 01:14 |
2 | GitHub & the team | 01:31 |
3 | Architecture of this project | 03:20 |
4 | Introduction Weather API | 02:19 |
5 | Setup Google Cloud Account | 02:13 |
6 | Creating the project | 02:36 |
7 | Enabling the required APIs | 01:35 |
8 | Configure scheduling | 02:21 |
9 | Setup VM for database interaction | 02:54 |
10 | Setup mysql database | 02:17 |
11 | Setup vm client and create database | 02:47 |
12 | Creating pub/sub message queue | 01:42 |
13 | Create cloud function to pull data form API | 04:18 |
14 | Explanation code pull from API | 04:21 |
15 | Create function to write to db | 07:48 |
16 | Explanation code write data to db | 05:57 |
17 | Testing the function | 05:52 |
18 | Create function write data to db - pull | 03:54 |
19 | Explanation code write data to db - pull | 04:34 |
20 | Setup Looker Studio and create bubble chart | 02:21 |
21 | Setup Looker Studio and create time series chart | 01:58 |
22 | Pipeline Monitoring | 06:21 |
23 | Conclusion & Challenges | 03:20 |
Comments
0 commentsSimilar courses

Deep Learning: Advanced Computer Vision
Sources: udemy
This is one of the most exciting courses I’ve done and it really shows how fast and how far deep learning has come over the years. When I first started my deep
15 hours 10 minutes 54 seconds

DS4B 101-P: Python for Data Science Automation
Sources: Business Science University
Python for Data Science Automation is an innovative course designed to teach data analysts how to convert business processes to python-based data science automations. The course...
27 hours 6 minutes 1 second

Relational Data Modeling
Sources: Eka Ponkratova
Relational modeling is widely used in building transactional databases. You might say, "But I'm not planning to become a backend engineer."
1 hour 52 minutes

Choosing Data Stores
Sources: Andreas Kretz
One of the key tasks when creating a data platform and pipelines is the selection of appropriate data storage systems. This course is dedicated to that topic.
1 hour 25 minutes 31 seconds

Snowflake for Data Engineers
Sources: Andreas Kretz
Snowflake is a next-generation cloud data warehouse that everyone is talking about today. The platform operates 100% in the cloud, providing flexible access...
2 hours 4 minutes 8 seconds
Want to join the conversation?
Sign in to comment