AI Engineering Course
1h 36m 46s
English
Paid
Course description
This course is designed to help programmers and developers transition into the field of artificial intelligence engineering. You will thoroughly explore vector databases, indexing, large language models (LLM), and the attention mechanism.
Read more about the course
By the end of the course, you will understand how LLMs work and be able to use them to create real applications.
What you will learn:
- Develop mental models of how LLMs in the style of GPT work
- Understand processes such as tokenization, embeddings, attention, and masking
- Optimize LLM inference using caching, batching, and quantization
- Design and deploy RAG pipelines using vector databases
- Compare methods: prompt engineering, fine-tuning, and agent-based architectures
- Debug, monitor, and scale LLM systems in production
Watch Online
0:00
/ #1: Course Intro
All Course Lessons (21)
| # | Lesson Title | Duration | Access |
|---|---|---|---|
| 1 | Course Intro Demo | 02:01 | |
| 2 | Usecase | 01:48 | |
| 3 | How are vectors constructed | 06:43 | |
| 4 | Choosing the right DB | 03:27 | |
| 5 | Vector compression | 03:27 | |
| 6 | Vector Search | 06:59 | |
| 7 | Milvus DB | 05:38 | |
| 8 | LLM Intro | 00:43 | |
| 9 | How LLMs work | 08:31 | |
| 10 | LLM text generation | 03:08 | |
| 11 | LLM improvements | 05:10 | |
| 12 | Attention | 05:28 | |
| 13 | Transformer Architecture | 03:40 | |
| 14 | KV Cache | 08:28 | |
| 15 | Paged Attention | 04:38 | |
| 16 | Mixture Of Experts | 04:01 | |
| 17 | Flash Attention | 03:40 | |
| 18 | Quantization | 03:33 | |
| 19 | Sparse Attention | 05:14 | |
| 20 | SLM and Distillation | 05:31 | |
| 21 | Speculative Decoding | 04:58 |
Unlock unlimited learning
Get instant access to all 20 lessons in this course, plus thousands of other premium courses. One subscription, unlimited knowledge.
Learn more about subscriptionBooks
Read Book AI Engineering Course
| # | Title |
|---|---|
| 1 | 1. Vector+Embeddings+&+Semantic+Space |
| 2 | 2. Compression+&+Quantization_+Scaling+Vectors+Efficiently-4 |
| 3 | 3. Indexing+Techniques_+Making+Vector+Search+Scale |
| 4 | 4. Search+Execution+Flow_+From+Query+to+Result |
| 5 | 5. LLMs+and+RAG |
| 6 | 6. What+is+Attention+and+Why+Does+It+Matter |
| 7 | 7. Paged+Attention |
| 8 | 8. Quantization+Summary |
Comments
0 commentsWant to join the conversation?
Sign in to commentSimilar courses
Build Your SaaS AI Web Platform From Zero to Production
Sources: Code4Startup (coderealprojects)
Discover the new trend in the world of startups and indie hackers - the creation of microservice AI-SaaS products that do more than just meet needs...
8 hours 36 minutes 2 seconds
Build a Reasoning Model (From Scratch)
Sources: Sebastian Raschka
Understand how LLMs reason by creating your own reasoning model from scratch. In the book "Building a Reasoning Model from Scratch," you will step by step...
5 Levels of Agents - Coding Agents
Sources: Mckay Wrigley (takeoff)
This course teaches the creation of intelligent coding agents by going through five levels of complexity. You will learn to develop agents for review and...
5 hours 4 minutes 36 seconds
Build and Deploy a B2B SaaS AI Support Platform
Sources: Code With Antonio
In this course, we will build a customer support platform powered by AI from scratch: we will set up a live chat using Convex Agents, add voice support through.
22 hours 20 minutes 55 seconds
Build AI Agents with n8n
Sources: zerotomastery.io
Learn how to create AI agents in n8n without coding. Discover how to integrate language models, configure triggers, and set up nodes for task automation.
2 hours 51 minutes 16 seconds