Skip to main content

Semantic Log Indexing & Search

53m 37s
English
Paid

Course description

Semantic search is one of the most practical applications of generative AI in real data processing projects. In this course, we go beyond the basic introduction to embeddings (from the course The Hidden Foundation of GenAI) and start using them in practice. You will learn to build a complete semantic search pipeline from scratch: from creating embeddings and storing them in a vector database to performing natural language queries.

The course is built around a real data observability project. You will create a pipeline that collects logs, processes them using FastAPI, and stores the embeddings in qdrant - a high-performance vector storage. Then, you will develop a dashboard on Streamlit, allowing you to search logs by meaning, rather than by keywords, and compare the results with traditional SQL queries in DuckDB.

Read more about the course

Key steps of the course:

  1. From embeddings to search: review the basics of embeddings and analyze how exactly they enable semantic search functionality.
  2. Building a pipeline: implementing an API on FastAPI for log processing and embedding generation.
  3. Working with qdrant: collections, points, cosine similarity search, and optimization of embedding structure.
  4. Streamlit interface: creating a user-friendly search and comparing the semantic approach with classic SQL.
  5. Improving accuracy: methods for optimizing embeddings, query formulation, and search configuration.
  6. Launching in Docker: deploying the entire stack (FastAPI, qdrant, Streamlit, DuckDB) using Docker Compose.
  7. Bonus: using DuckDB for analytics - implementing WAL, working with data in Docker, and comparing the capabilities of SQL and vector search.


Upon completion of the course, you will not only understand the mechanics of semantic search but also have a ready-to-use working project that can be adapted for your own AI-based solutions.

Watch Online

This is a demo lesson (10:00 remaining)

You can watch up to 10 minutes for free. Subscribe to unlock all 16 lessons in this course and access 10,000+ hours of premium content across all courses.

View Pricing

Watch Online Semantic Log Indexing & Search

0:00
/
#1: Intro

All Course Lessons (16)

#Lesson TitleDurationAccess
1
Intro Demo
00:44
2
Getting Started: Semantic Search for Your Logs
03:08
3
Dissecting the Pipeline Monitor Architecture: FastAPI, Qdrant & DuckDB
03:50
4
Beginner’s Guide to Qdrant Collections and Similarity Search
03:28
5
Your First Glimpse at the Project Code Structure on GitHub
02:55
6
Building and Launching the Pipeline with Docker Compose
04:37
7
Writing JSON Logs to FastAPI: Bulk Upload Explained
01:42
8
How FastAPI Parses LogEntry Models and Prepares Embeddings
04:37
9
Embeddings 101: Turning Your Logs into Searchable Vectors
02:06
10
Querying Qdrant: From Playground to Streamlit Dashboard
03:55
11
Hands-On Embedding Tuning: Boost Your Log Search Accuracy
03:54
12
Deploying Improved Embeddings and Measuring Improvement
05:35
13
What We Built and Why It Matters
02:53
14
How DuckDB Fits into Your Data Observability Stack
01:28
15
Writing to DuckDB with a Write-Ahead Log
05:03
16
Docker & DuckDB: Implementing WAL to Solve File Lock Errors
03:42

Unlock unlimited learning

Get instant access to all 15 lessons in this course, plus thousands of other premium courses. One subscription, unlimited knowledge.

Learn more about subscription

Comments

0 comments

Want to join the conversation?

Sign in to comment

Similar courses

Data Structures and Algorithmic Trading: Machine Learning

Data Structures and Algorithmic Trading: Machine Learning

Sources: udemy
Data Structures and Algorithmic trading is a method of executing orders using automated pre-programmed trading instructions over time. They were developed so th
2 hours 20 minutes 32 seconds
v0 Crash Course

v0 Crash Course

Sources: Mckay Wrigley (takeoff)
Quickly master the basics of working with v0 from Vercel - a tool for generating user interfaces based on text. In this course, you will learn how to...
47 minutes 41 seconds
Build Fast Masterclass

Build Fast Masterclass

Sources: BuildFast Academy
How to finally Launch your AI Product (without ripping your hair out).. that makes you money in 30 days (or less). But unlike other AI courses, you won't learn
7 hours 22 minutes 11 seconds
AI Engineering: Customizing LLMs for Business (Fine-Tuning LLMs with QLoRA & AWS)

AI Engineering: Customizing LLMs for Business (Fine-Tuning LLMs with QLoRA & AWS)

Sources: zerotomastery.io
Master an in-demand skill that companies are looking for: the development and implementation of custom LLMs. In the course, you will learn how to fine-tune open
7 hours 12 minutes 10 seconds
RAG: Beyond Basics

RAG: Beyond Basics

Sources: Prompt Engineering
The course is dedicated to the practical and theoretical study of Retrieval-Augmented Generation (RAG). You will learn not only "how" but also "why" these...
2 hours 40 minutes 48 seconds