AI Engineering Course
By the end of the course, you will understand how LLMs work and be able to use them to create real applications.
What you will learn:
- Develop mental models of how LLMs in the style of GPT work
- Understand processes such as tokenization, embeddings, attention, and masking
- Optimize LLM inference using caching, batching, and quantization
- Design and deploy RAG pipelines using vector databases
- Compare methods: prompt engineering, fine-tuning, and agent-based architectures
- Debug, monitor, and scale LLM systems in production
About the Author: get.interviewready.io
get.interviewready.io is the paid course platform of Gaurav Sen, a software engineer (formerly at Uber) and one of the most widely watched system-design-interview educators on YouTube. His teaching style focuses on building the mental model from first principles — load balancers, sharding, queues, the trade-offs of CAP — rather than memorising specific architectures.
The CourseFlix listing carries his System Design Course and AI Engineering Course. Material is paid and aimed at engineers preparing for senior-level technical interviews at large tech companies, plus a newer track on building AI / LLM-powered systems.
Watch Online 21 lessons
| # | Lesson Title | Duration | Access |
|---|---|---|---|
| 1 | Course Intro Demo | 02:01 | |
| 2 | Usecase | 01:48 | |
| 3 | How are vectors constructed | 06:43 | |
| 4 | Choosing the right DB | 03:27 | |
| 5 | Vector compression | 03:27 | |
| 6 | Vector Search | 06:59 | |
| 7 | Milvus DB | 05:38 | |
| 8 | LLM Intro | 00:43 | |
| 9 | How LLMs work | 08:31 | |
| 10 | LLM text generation | 03:08 | |
| 11 | LLM improvements | 05:10 | |
| 12 | Attention | 05:28 | |
| 13 | Transformer Architecture | 03:40 | |
| 14 | KV Cache | 08:28 | |
| 15 | Paged Attention | 04:38 | |
| 16 | Mixture Of Experts | 04:01 | |
| 17 | Flash Attention | 03:40 | |
| 18 | Quantization | 03:33 | |
| 19 | Sparse Attention | 05:14 | |
| 20 | SLM and Distillation | 05:31 | |
| 21 | Speculative Decoding | 04:58 |
Get instant access to all 20 lessons in this course, plus thousands of other premium courses. One subscription, unlimited knowledge.
Learn more about subscriptionBooks
Read Book AI Engineering Course
Related courses
-
Updated 7mo agoOvernight Fullstack Applications
By: Newline (ex-Fullstack.io)If you are a freelancer or indie hacker for whom speed of implementation is just as important as quality, this course could be the most exciting one this year.28 minutes 5 seconds 5 / 5 -
Updated 1mo agoAI Systems Performance Engineering
By: Chris FreglyExplore the strategy for optimizing AI systems with a focus on hardware and software. Methods for scaling and cost savings for training and inference. -
Updated 2y agoBuilding AI Apps with the Gemini API
By: Zero To MasteryLearn to use Google's Gemini API for building AI-powered applications. Plus you'll put your skills into action by building three projects using the Gemini API.3 hours 43 minutes 41 seconds 5 / 5