Data Engineering on Azure

1h 20m 57s
English
Paid

Microsoft Azure is a cloud platform offering over 200 products and services for data storage, management, virtual machine deployment, and application development in the cloud. Azure supports working with various frameworks and tools, allowing applications to run in a multi-cloud environment, locally, or at the network edge.

Read more about the course

What you will learn in the course

In this course guided by Kristian Bakarich, you will practically create a streaming data processing pipeline in Azure. As part of the project, you will learn to use key Azure services for processing Twitter data streams in JSON format, including:

  • APIM (API Management) - for data intake,
  • Blob Storage - for storage,
  • Azure Functions - for processing,
  • Cosmos DB - for storing processed data,
  • Power BI - for data visualization.

Project Structure

  1. Introduction and Architecture
    1. Get acquainted with the overall solution architecture and key components of the pipeline.
  2. Data Creation and Sending
    1. Write a JSON file with messages, create a Python script to send JSON objects via HTTP requests to Azure API Management.
  3. Development and Deployment of Azure Functions
    1. Learn to create and deploy Azure functions in Python using Visual Studio Code, create a function project with basic logic.
  4. Service Integration
    1. Set up and integrate Event Hubs, Azure Functions, and Cosmos DB, learn to write messages from Event Hub to Cosmos DB.
  5. Data Visualization in Power BI
    1. Connect Power BI Desktop to the Cosmos DB for real-time data visualization.

Required Knowledge and Prerequisites

  • An Azure account
  • Basic programming skills (Python)
  • Basic knowledge of working with data storage
  • Basics of API (recommended course: "Designing and Developing APIs with FastAPI")
  • Basics of working with message queues

Watch Online Data Engineering on Azure

Join premium to watch
Go to premium
# Title Duration
1 Data Engineering in Azure - Streaming Data Pipelines 02:44
2 Introduction to Datasets and Local Preprocessing 07:07
3 Deploying your Code on Visual Studio to Docker containers 05:28
4 Develop Azure Functions via Python and VS Code 05:53
5 Deploy Azure Function to Azure Function App and Test it 06:27
6 Integrate Azure Function with Blob Storage via bindings 04:59
7 Expose Azure Function as a Backend, and Test it from Insomnia 07:06
8 Securely Store Secrets in Azure Key Vault and Connect APIM to Key Vault 04:42
9 Add Basic authentication in API Management using Key Vault and Named Values 04:36
10 Test APIM and Imported Azure Function App and Function via Local Python Program 02:35
11 Create Event Hubs and Test Capture Events Feature 07:00
12 Modify Existing Azure Function to Include Event Hubs Binding and Test It 06:43
13 Create a Cosmos DB (Core SQL) and Create a New Azure Function that writes Messages to Cosmos DB 09:04
14 Connect Power Bi Desktop via Connector, and create a dashboard 06:33

Similar courses to Data Engineering on Azure

dbt for Data Engineers

dbt for Data EngineersAndreas Kretz

Category: Data processing and analysis
Duration 1 hour 52 minutes 55 seconds
Deep Learning A-Z™: Hands-On Artificial Neural Networks

Deep Learning A-Z™: Hands-On Artificial Neural Networksudemy

Category: Python, Data processing and analysis
Duration 22 hours 36 minutes 30 seconds
Statistics for Data Science and Business Analysis

Statistics for Data Science and Business Analysisudemy

Category: Data processing and analysis
Duration 4 hours 49 minutes 30 seconds
Case Study in Product Data Science

Case Study in Product Data ScienceLunarTech

Category: Data processing and analysis
Duration 1 hour 4 minutes 47 seconds
DS4B 101-P: Python for Data Science Automation

DS4B 101-P: Python for Data Science AutomationBusiness Science University

Category: Python, Data processing and analysis
Duration 27 hours 6 minutes 1 second
Machine Learning & Containers on AWS

Machine Learning & Containers on AWSAndreas Kretz

Category: Data processing and analysis, Machine learning
Duration 1 hour 33 minutes 34 seconds
Data Engineering with Hadoop

Data Engineering with HadoopSuyog Nagaokar

Category: Data processing and analysis
Duration 7 hours 3 minutes
TensorFlow Developer Certificate in 2023: Zero to Mastery

TensorFlow Developer Certificate in 2023: Zero to Masteryzerotomastery.io

Category: Data processing and analysis
Duration 62 hours 43 minutes 54 seconds
AZ-104 Microsoft Azure Administrator

AZ-104 Microsoft Azure AdministratorAdrian Cantrill

Category: Azure
Duration 27 hours 1 minute 23 seconds
MongoDB Fundamentals

MongoDB FundamentalsAndreas Kretz

Category: MongoDB, Data processing and analysis
Duration 1 hour 23 minutes 19 seconds