Skip to main content

Choosing Data Stores

1h 25m 31s
English
Paid

Course description

One of the key tasks in creating a data platform and pipelines is choosing the appropriate data storage. This course is dedicated to this topic.

We will examine relational and NoSQL databases, as well as data warehouses and data lakes. You will learn when to use each type of storage and how to properly integrate it into your architecture.

After completing the course, you will understand how to store data and how to choose the appropriate storage for specific tasks. This will help you better navigate different types of storage and make informed decisions in your work as a data engineer. In subsequent courses, we will delve into specific technologies from each category.

Read more about the course

Basics of Data Warehouses

First, you will study the basic principles: the differences between OLTP (Operational Transactional Systems) and OLAP (Analytical Systems), and the scenarios in which they are used. You will also learn what ETL (Extract, Transform, Load) and ELT (Extract, Load, Transform) are and how these methods relate to the choice of data warehouses. At the end of the section, I will share a resource where you can further explore the types of data warehouses and compare them with each other.

Relational Databases

We will go through a step-by-step guide to selecting the appropriate data storage that you can use in your work. Then, we will take a closer look at relational databases: you will learn about the principles of CRUD and ACID, as well as get acquainted with examples of specific DBMS.

NoSQL Databases

Here you will learn what NoSQL is, what types of such databases exist (document-based, columnar, temporal, search), their characteristics, and the tasks for which they are suitable. We will also discuss the trade-offs between read and write speed and the importance of setting goals when choosing storage.

Data Warehouses and Data Lakes

At the end of the course, you will learn what data warehouses (Data Warehouses) and data lakes (Data Lakes) are, the differences between them, and the specific cases for using each solution.

Watch Online

This is a demo lesson (10:00 remaining)

You can watch up to 10 minutes for free. Subscribe to unlock all 16 lessons in this course and access 10,000+ hours of premium content across all courses.

View Pricing
0:00
/
#1: Introduction

All Course Lessons (16)

#Lesson TitleDurationAccess
1
Introduction Demo
02:10
2
OLTP vs OLAP
07:35
3
ETL vs ELT
05:46
4
Data Stores Ranking
04:06
5
How to Choose Data Stores
08:12
6
Relational Databases
06:35
7
NoSQL Basics
10:40
8
Document Stores
05:57
9
Time Series Databases
05:01
10
Search Engines
04:19
11
Wide Column Stores
04:23
12
Key Value Stores
05:00
13
Graph Databases
01:06
14
Data Warehouses
05:33
15
Data Lakes
07:11
16
Conclusion
01:57

Unlock unlimited learning

Get instant access to all 15 lessons in this course, plus thousands of other premium courses. One subscription, unlimited knowledge.

Learn more about subscription

Comments

0 comments

Want to join the conversation?

Sign in to comment

Similar courses

  • Becoming a Better Data Engineer

    Becoming a Better Data Engineer

    Sources: Andreas Kretz
    Data engineering is not just about moving information from one place to another. It is about creating reliable, scalable, and efficient systems that...
    1 hour 46 minutes 10 seconds
  • Azure Data Pipelines with Terraform

    Azure Data Pipelines with Terraform

    Sources: Andreas Kretz
    Azure is becoming an increasingly popular platform for companies using the Microsoft365 ecosystem. If you want to enhance your data engineering skills...
    4 hours 20 minutes 29 seconds
  • Machine Learning & Containers on AWS

    Machine Learning & Containers on AWS

    Sources: Andreas Kretz
    In this practical course, you will learn how to build a complete data pipeline on the AWS platform - from obtaining data from the Twitter API to analysis, stora
    1 hour 33 minutes 34 seconds
  • The Data Science Course: Complete Data Science Bootcamp 2023

    The Data Science Course: Complete Data Science Bootcamp 2023

    Sources: udemy
    Data scientist is one of the best suited professions to thrive this century. It is digital, programming-oriented, and analytical. Therefore, it comes as no surp
    31 hours 14 minutes 14 seconds
  • Build Fast Masterclass

    Build Fast Masterclass

    Sources: BuildFast Academy
    How to finally Launch your AI Product (without ripping your hair out).. that makes you money in 30 days (or less). But unlike other AI courses, you won't learn
    7 hours 22 minutes 11 seconds