AI Engineering Course
1h 36m 46s
English
Paid
Course description
This course is designed to help programmers and developers transition into the field of artificial intelligence engineering. You will thoroughly explore vector databases, indexing, large language models (LLM), and the attention mechanism.
Read more about the course
By the end of the course, you will understand how LLMs work and be able to use them to create real applications.
What you will learn:
- Develop mental models of how LLMs in the style of GPT work
- Understand processes such as tokenization, embeddings, attention, and masking
- Optimize LLM inference using caching, batching, and quantization
- Design and deploy RAG pipelines using vector databases
- Compare methods: prompt engineering, fine-tuning, and agent-based architectures
- Debug, monitor, and scale LLM systems in production
Watch Online
0:00
/ #1: Course Intro
All Course Lessons (21)
| # | Lesson Title | Duration | Access |
|---|---|---|---|
| 1 | Course Intro Demo | 02:01 | |
| 2 | Usecase | 01:48 | |
| 3 | How are vectors constructed | 06:43 | |
| 4 | Choosing the right DB | 03:27 | |
| 5 | Vector compression | 03:27 | |
| 6 | Vector Search | 06:59 | |
| 7 | Milvus DB | 05:38 | |
| 8 | LLM Intro | 00:43 | |
| 9 | How LLMs work | 08:31 | |
| 10 | LLM text generation | 03:08 | |
| 11 | LLM improvements | 05:10 | |
| 12 | Attention | 05:28 | |
| 13 | Transformer Architecture | 03:40 | |
| 14 | KV Cache | 08:28 | |
| 15 | Paged Attention | 04:38 | |
| 16 | Mixture Of Experts | 04:01 | |
| 17 | Flash Attention | 03:40 | |
| 18 | Quantization | 03:33 | |
| 19 | Sparse Attention | 05:14 | |
| 20 | SLM and Distillation | 05:31 | |
| 21 | Speculative Decoding | 04:58 |
Unlock unlimited learning
Get instant access to all 20 lessons in this course, plus thousands of other premium courses. One subscription, unlimited knowledge.
Learn more about subscriptionBooks
Read Book AI Engineering Course
| # | Title |
|---|---|
| 1 | 1. Vector+Embeddings+&+Semantic+Space |
| 2 | 2. Compression+&+Quantization_+Scaling+Vectors+Efficiently-4 |
| 3 | 3. Indexing+Techniques_+Making+Vector+Search+Scale |
| 4 | 4. Search+Execution+Flow_+From+Query+to+Result |
| 5 | 5. LLMs+and+RAG |
| 6 | 6. What+is+Attention+and+Why+Does+It+Matter |
| 7 | 7. Paged+Attention |
| 8 | 8. Quantization+Summary |
Comments
0 commentsWant to join the conversation?
Sign in to commentSimilar courses
AI Engineering: Fine-Tuning LLMs
Sources: zerotomastery.io
If you're interested in an AI that actually works, not just sounds impressive, this compact course is just for you. Fine-tuning the GPT model is not just...
1 hour 35 minutes 46 seconds
Build a React Native app with Claude AI
Sources: designcode.io
This comprehensive course is dedicated to integrating advanced AI tools into the workflow of development in React Native, which allows for a radical change in a
13 hours 53 minutes 10 seconds
Learn how to use MCP (Model Context Protocol)
Sources: Kevin Kern (instructa.ai)
The course is dedicated to mastering the Model Context Protocol (MCP) - an open standard developed by Anthropic for connecting large language models (LLM)...
3 hours 10 minutes 2 seconds
The Complete AI Fast Track Bootcamp - 2024
Sources: Code4Startup (coderealprojects)
The Complete AI Fast Track Bootcamp - 2024 is an intensive online course designed for the rapid acquisition of key skills in the field of artificial intelligenc
10 hours 59 minutes 1 second