Skip to main content

The Hidden Foundation of GenAI

20m 42s
English
Paid

Course description

Generative AI is everywhere today, but few understand the fundamental concepts it's built upon. "The Hidden Foundation of GenAI" is a starting point for those who want to truly understand what lies behind LLM, vector search, and semantic understanding. This course is specifically designed for data engineers and focuses on embeddings—one of the most important (and most misinterpreted) building blocks of any GenAI system.

Instead of overloading with mathematical theory, we provide practical insights: how text is converted into vectors, how similarity is calculated, and how this underlies scenarios like semantic search and Retrieval-Augmented Generation (RAG). You will work with an interactive Embedding Playground, analyze Python examples, and gain the confidence to use vector search in your own projects.

This course opens a series of sessions on GenAI at the Academy. In the following modules, you will continue exploring semantic search, vector databases, and complete your journey with a full-fledged project—implementing a GenAI pipeline with RAG.

Read more about the course

What awaits you in the course:

  1. Clear and practical introduction to embeddings without excessive terminology.
  2. Working with Embedding Playground and understanding the mechanics of text similarity.
  3. Step-by-step breakdown of converting text into vectors and the role of embedding models.
  4. Practice in Python: cosine similarity, the difference between structural and semantic similarity.
  5. Real-world aspects: tokens, the cost of LLM API requests, and the impact of this on production workloads.

Watch Online

This is a demo lesson (10:00 remaining)

You can watch up to 10 minutes for free. Subscribe to unlock all 9 lessons in this course and access 10,000+ hours of premium content across all courses.

View Pricing
0:00
/
#1: Intro to the GenAI Track: Practical Foundations for Data Engineers

All Course Lessons (9)

#Lesson TitleDurationAccess
1
Intro to the GenAI Track: Practical Foundations for Data Engineers Demo
00:25
2
Embeddings in Action: Playground, Search, and RAG
01:47
3
Hands-On with Embeddings: Comparing Text Similarity
02:28
4
Understanding Similarity: From Angles to Embedding Scores
02:14
5
Text Structure vs. Meaning: Understanding Embedding Scores
02:23
6
Why Your Embedding Model Matters (A Lot)
02:34
7
Understanding Tokens: From Text to Vectors to Cost
03:43
8
Embedding Walkthrough: Real Data in Semantic Search and RAG Pipelines
04:20
9
That’s It, you Know Enough to Build
00:48

Unlock unlimited learning

Get instant access to all 8 lessons in this course, plus thousands of other premium courses. One subscription, unlimited knowledge.

Learn more about subscription

Comments

0 comments

Want to join the conversation?

Sign in to comment

Similar courses

The Software Designer Mindset (COMPLETE)

The Software Designer Mindset (COMPLETE)

Sources: ArjanCodes
"The Software Designer Mindset" is a course that teaches all aspects of software architecture and offers practical advice on creating scalable software...
14 hours 32 minutes 58 seconds
Model Context Protocol (MCP) 101

Model Context Protocol (MCP) 101

Sources: Mckay Wrigley (takeoff)
In this course, you will learn what Model Context Protocol (MCP) is, why it is important, and how to apply it in practice. We will cover the main principles...
2 hours 10 minutes 15 seconds
Python and Django Full Stack Web Developer Bootcamp

Python and Django Full Stack Web Developer Bootcamp

Sources: udemy
Welcome to the Python and Django Full Stack Web Developer Bootcamp! In this course we cover everything you need to know to build a website using Python, Django,
31 hours 54 minutes 39 seconds
Beginner Python Primer for AI Engineering

Beginner Python Primer for AI Engineering

Sources: Towards AI, Louis-François Bouchard
Don't just interact with LLM models - create your own AI solutions in Python. This course will take you from beginner to confident proficiency in Python...
1 hour 41 minutes 58 seconds