Choosing Data Stores

1h 25m 31s
English
Paid

One of the key tasks in creating a data platform and pipelines is choosing the appropriate data storage. This course is dedicated to this topic.

We will examine relational and NoSQL databases, as well as data warehouses and data lakes. You will learn when to use each type of storage and how to properly integrate it into your architecture.

After completing the course, you will understand how to store data and how to choose the appropriate storage for specific tasks. This will help you better navigate different types of storage and make informed decisions in your work as a data engineer. In subsequent courses, we will delve into specific technologies from each category.

Read more about the course

Basics of Data Warehouses

First, you will study the basic principles: the differences between OLTP (Operational Transactional Systems) and OLAP (Analytical Systems), and the scenarios in which they are used. You will also learn what ETL (Extract, Transform, Load) and ELT (Extract, Load, Transform) are and how these methods relate to the choice of data warehouses. At the end of the section, I will share a resource where you can further explore the types of data warehouses and compare them with each other.

Relational Databases

We will go through a step-by-step guide to selecting the appropriate data storage that you can use in your work. Then, we will take a closer look at relational databases: you will learn about the principles of CRUD and ACID, as well as get acquainted with examples of specific DBMS.

NoSQL Databases

Here you will learn what NoSQL is, what types of such databases exist (document-based, columnar, temporal, search), their characteristics, and the tasks for which they are suitable. We will also discuss the trade-offs between read and write speed and the importance of setting goals when choosing storage.

Data Warehouses and Data Lakes

At the end of the course, you will learn what data warehouses (Data Warehouses) and data lakes (Data Lakes) are, the differences between them, and the specific cases for using each solution.

Watch Online Choosing Data Stores

Join premium to watch
Go to premium
# Title Duration
1 Introduction 02:10
2 OLTP vs OLAP 07:35
3 ETL vs ELT 05:46
4 Data Stores Ranking 04:06
5 How to Choose Data Stores 08:12
6 Relational Databases 06:35
7 NoSQL Basics 10:40
8 Document Stores 05:57
9 Time Series Databases 05:01
10 Search Engines 04:19
11 Wide Column Stores 04:23
12 Key Value Stores 05:00
13 Graph Databases 01:06
14 Data Warehouses 05:33
15 Data Lakes 07:11
16 Conclusion 01:57

Similar courses to Choosing Data Stores

Deep Learning A-Z™: Hands-On Artificial Neural Networks

Deep Learning A-Z™: Hands-On Artificial Neural Networksudemy

Category: Python, Data processing and analysis
Duration 22 hours 36 minutes 30 seconds
Statistics Bootcamp (with Python): Zero to Mastery

Statistics Bootcamp (with Python): Zero to Masteryzerotomastery.io

Category: Python, ChatGPT, Data processing and analysis
Duration 20 hours 50 minutes 51 seconds
Spark and Python for Big Data with PySpark

Spark and Python for Big Data with PySparkudemy

Category: Python, Data processing and analysis
Duration 10 hours 35 minutes 43 seconds
Time Series Analysis, Forecasting, and Machine Learning

Time Series Analysis, Forecasting, and Machine Learningudemy

Category: Python, Data processing and analysis
Duration 22 hours 47 minutes 45 seconds
The Data Science Course: Complete Data Science Bootcamp 2023

The Data Science Course: Complete Data Science Bootcamp 2023udemy

Category: Data processing and analysis
Duration 31 hours 14 minutes 14 seconds
Complete linear algebra: theory and implementation

Complete linear algebra: theory and implementationudemy

Category: Python, Data processing and analysis
Duration 32 hours 53 minutes 26 seconds
Building APIs with FastAPI

Building APIs with FastAPIAndreas Kretz

Category: Python, Data processing and analysis
Duration 1 hour 35 minutes 40 seconds
Apache Spark Certification Training

Apache Spark Certification TrainingFlorian Roscheck

Category: Python, Data processing and analysis
Duration 15 hours 13 minutes 1 second
Data Platform & Pipeline Design

Data Platform & Pipeline DesignAndreas Kretz

Category: Data processing and analysis
Duration 1 hour 59 minutes 5 seconds
Machine Learning Design Questions

Machine Learning Design Questionsalgoexpert

Category: Data processing and analysis
Duration 3 hours 3 minutes 57 seconds