Skip to main content

Choosing Data Stores

1h 25m 31s
English
Paid

Choosing the right data storage is a fundamental task in creating a data platform and building pipelines. This course is dedicated to guiding you through this important topic.

Overview of Data Storage Types

Throughout this course, we will explore different types of data storage solutions including relational and NoSQL databases, as well as data warehouses and data lakes. You will learn the appropriate scenarios for using each type of storage and how to effectively integrate them into your architecture.

Completing this course will equip you with the necessary knowledge to understand various data storage solutions and help you make informed decisions, enhancing your capabilities as a data engineer. Future courses will dive deeper into specific technologies within each category.

Basics of Data Warehousing

We begin by discussing foundational principles: the key differences between OLTP (Operational Transactional Systems) and OLAP (Analytical Systems), and their respective use cases. You will also learn about ETL (Extract, Transform, Load) and ELT (Extract, Load, Transform), and how these processes influence the selection of data warehouses. At the end of this section, additional resources will be provided for further exploration and comparison of different data warehouse types.

Understanding Relational Databases

This section offers a step-by-step guide to selecting a suitable data storage solution for your projects. You will gain an in-depth understanding of relational databases, including CRUD operations and the ACID principles, along with examples from specific Database Management Systems (DBMS).

Exploring NoSQL Databases

In this part of the course, you will discover what NoSQL databases are, including various types such as document-based, columnar, temporal, and search databases. We will examine their unique characteristics and suitable applications. Additionally, we will discuss the trade-offs between read and write speeds and emphasize the importance of setting goals when selecting storage solutions.

Data Warehouses vs. Data Lakes

The course concludes with a detailed comparison of data warehouses and data lakes. You will learn about the differences between these storage solutions and the specific use cases where each excels. This knowledge will further aid in your decision-making process when architecting data platforms.

About the Author: Andreas Kretz

Andreas Kretz thumbnail

I am a senior data engineer and trainer, a tech enthusiast, and a father. For more than ten years, I have been passionate about Data Engineering. Initially, I became a self-taught data engineer and then led a team of data engineers at a large company. When I realized the great demand for education in this field, I followed my passion and founded my own Data Engineering Academy. Since then, I have helped over 2,000 students achieve their goals.

Watch Online 16 lessons

This is a demo lesson (10:00 remaining)

You can watch up to 10 minutes for free. Subscribe to unlock all 16 lessons in this course and access 10,000+ hours of premium content across all courses.

View Pricing
0:00
/
#1: Introduction
All Course Lessons (16)
#Lesson TitleDurationAccess
1
Introduction Demo
02:10
2
OLTP vs OLAP
07:35
3
ETL vs ELT
05:46
4
Data Stores Ranking
04:06
5
How to Choose Data Stores
08:12
6
Relational Databases
06:35
7
NoSQL Basics
10:40
8
Document Stores
05:57
9
Time Series Databases
05:01
10
Search Engines
04:19
11
Wide Column Stores
04:23
12
Key Value Stores
05:00
13
Graph Databases
01:06
14
Data Warehouses
05:33
15
Data Lakes
07:11
16
Conclusion
01:57
Unlock unlimited learning

Get instant access to all 15 lessons in this course, plus thousands of other premium courses. One subscription, unlimited knowledge.

Learn more about subscription