Skip to main content

The Hidden Foundation of GenAI

20m 42s
English
Paid

Generative AI is everywhere today, but few understand the fundamental concepts it's built upon. "The Hidden Foundation of GenAI" is a starting point for those who want to truly understand what lies behind LLM, vector search, and semantic understanding. This course is specifically designed for data engineers and focuses on embeddings—one of the most important (and most misinterpreted) building blocks of any GenAI system.

Instead of overloading with mathematical theory, we provide practical insights: how text is converted into vectors, how similarity is calculated, and how this underlies scenarios like semantic search and Retrieval-Augmented Generation (RAG). You will work with an interactive Embedding Playground, analyze Python examples, and gain the confidence to use vector search in your own projects.

This course opens a series of sessions on GenAI at the Academy. In the following modules, you will continue exploring semantic search, vector databases, and complete your journey with a full-fledged project—implementing a GenAI pipeline with RAG.

What awaits you in the course:

  1. Clear and practical introduction to embeddings without excessive terminology.
  2. Working with Embedding Playground and understanding the mechanics of text similarity.
  3. Step-by-step breakdown of converting text into vectors and the role of embedding models.
  4. Practice in Python: cosine similarity, the difference between structural and semantic similarity.
  5. Real-world aspects: tokens, the cost of LLM API requests, and the impact of this on production workloads.

About the Author: Andreas Kretz

Andreas Kretz thumbnail

I am a senior data engineer and trainer, a tech enthusiast, and a father. For more than ten years, I have been passionate about Data Engineering. Initially, I became a self-taught data engineer and then led a team of data engineers at a large company. When I realized the great demand for education in this field, I followed my passion and founded my own Data Engineering Academy. Since then, I have helped over 2,000 students achieve their goals.

Watch Online 9 lessons

This is a demo lesson (10:00 remaining)

You can watch up to 10 minutes for free. Subscribe to unlock all 9 lessons in this course and access 10,000+ hours of premium content across all courses.

View Pricing
0:00
/
#1: Intro to the GenAI Track: Practical Foundations for Data Engineers
All Course Lessons (9)
#Lesson TitleDurationAccess
1
Intro to the GenAI Track: Practical Foundations for Data Engineers Demo
00:25
2
Embeddings in Action: Playground, Search, and RAG
01:47
3
Hands-On with Embeddings: Comparing Text Similarity
02:28
4
Understanding Similarity: From Angles to Embedding Scores
02:14
5
Text Structure vs. Meaning: Understanding Embedding Scores
02:23
6
Why Your Embedding Model Matters (A Lot)
02:34
7
Understanding Tokens: From Text to Vectors to Cost
03:43
8
Embedding Walkthrough: Real Data in Semantic Search and RAG Pipelines
04:20
9
That’s It, you Know Enough to Build
00:48
Unlock unlimited learning

Get instant access to all 8 lessons in this course, plus thousands of other premium courses. One subscription, unlimited knowledge.

Learn more about subscription