Skip to main content

AI Engineering Course

1h 36m 46s
English
Paid

Course description

This course is designed to help programmers and developers transition into the field of artificial intelligence engineering. You will thoroughly explore vector databases, indexing, large language models (LLM), and the attention mechanism.
Read more about the course

By the end of the course, you will understand how LLMs work and be able to use them to create real applications.

What you will learn:

  • Develop mental models of how LLMs in the style of GPT work
  • Understand processes such as tokenization, embeddings, attention, and masking
  • Optimize LLM inference using caching, batching, and quantization
  • Design and deploy RAG pipelines using vector databases
  • Compare methods: prompt engineering, fine-tuning, and agent-based architectures
  • Debug, monitor, and scale LLM systems in production

Watch Online

This is a demo lesson (10:00 remaining)

You can watch up to 10 minutes for free. Subscribe to unlock all 21 lessons in this course and access 10,000+ hours of premium content across all courses.

View Pricing
0:00
/
#1: Course Intro

All Course Lessons (21)

#Lesson TitleDurationAccess
1
Course Intro Demo
02:01
2
Usecase
01:48
3
How are vectors constructed
06:43
4
Choosing the right DB
03:27
5
Vector compression
03:27
6
Vector Search
06:59
7
Milvus DB
05:38
8
LLM Intro
00:43
9
How LLMs work
08:31
10
LLM text generation
03:08
11
LLM improvements
05:10
12
Attention
05:28
13
Transformer Architecture
03:40
14
KV Cache
08:28
15
Paged Attention
04:38
16
Mixture Of Experts
04:01
17
Flash Attention
03:40
18
Quantization
03:33
19
Sparse Attention
05:14
20
SLM and Distillation
05:31
21
Speculative Decoding
04:58

Unlock unlimited learning

Get instant access to all 20 lessons in this course, plus thousands of other premium courses. One subscription, unlimited knowledge.

Learn more about subscription

Books

Read Book AI Engineering Course

#Title
11. Vector+Embeddings+&+Semantic+Space
22. Compression+&+Quantization_+Scaling+Vectors+Efficiently-4
33. Indexing+Techniques_+Making+Vector+Search+Scale
44. Search+Execution+Flow_+From+Query+to+Result
55. LLMs+and+RAG
66. What+is+Attention+and+Why+Does+It+Matter
77. Paged+Attention
88. Quantization+Summary

Comments

0 comments

Want to join the conversation?

Sign in to comment

Similar courses

AI SDK v5 Crash Course

AI SDK v5 Crash Course

Sources: Matt Pocock
New course on AI SDK v5: modern patterns and tools for developing LLM applications in TypeScript. Master integration and create projects ready for...
2 hours 45 minutes 17 seconds
Build a Simple Neural Network & Learn Backpropagation

Build a Simple Neural Network & Learn Backpropagation

Sources: zerotomastery.io
Learn backpropagation and gradient descent by writing a simple neural network from scratch in Python - without libraries, just the basics. Ideal...
4 hours 34 minutes 9 seconds
Master the Model Context Protocol (MCP)

Master the Model Context Protocol (MCP)

Sources: Kent C. Dodds
The most interesting thing in software right now is MCP. It's a protocol that turns applications into smart conversational partners: instead of clicking...
7 hours 23 minutes 25 seconds
Create your Dream Apps with Cursor and Claude AI

Create your Dream Apps with Cursor and Claude AI

Sources: designcode.io
In this course, you will learn to create dream web applications from scratch using Cursor, Claude AI, and a set of powerful AI tools. You will master...
5 hours 46 minutes 6 seconds
Advanced Context Engineering with Claude Code

Advanced Context Engineering with Claude Code

Sources: egghead
Learn context management for AI in the Claude Code course. Discover tools, frameworks, and techniques for predictable artificial intelligence outcomes.
12 minutes 46 seconds