Skip to main content

AI Engineering Course

1h 36m 46s
English
Paid

Course description

This course is designed to help programmers and developers transition into the field of artificial intelligence engineering. You will thoroughly explore vector databases, indexing, large language models (LLM), and the attention mechanism.
Read more about the course

By the end of the course, you will understand how LLMs work and be able to use them to create real applications.

What you will learn:

  • Develop mental models of how LLMs in the style of GPT work
  • Understand processes such as tokenization, embeddings, attention, and masking
  • Optimize LLM inference using caching, batching, and quantization
  • Design and deploy RAG pipelines using vector databases
  • Compare methods: prompt engineering, fine-tuning, and agent-based architectures
  • Debug, monitor, and scale LLM systems in production

Watch Online

This is a demo lesson (10:00 remaining)

You can watch up to 10 minutes for free. Subscribe to unlock all 21 lessons in this course and access 10,000+ hours of premium content across all courses.

View Pricing
0:00
/
#1: Course Intro

All Course Lessons (21)

#Lesson TitleDurationAccess
1
Course Intro Demo
02:01
2
Usecase
01:48
3
How are vectors constructed
06:43
4
Choosing the right DB
03:27
5
Vector compression
03:27
6
Vector Search
06:59
7
Milvus DB
05:38
8
LLM Intro
00:43
9
How LLMs work
08:31
10
LLM text generation
03:08
11
LLM improvements
05:10
12
Attention
05:28
13
Transformer Architecture
03:40
14
KV Cache
08:28
15
Paged Attention
04:38
16
Mixture Of Experts
04:01
17
Flash Attention
03:40
18
Quantization
03:33
19
Sparse Attention
05:14
20
SLM and Distillation
05:31
21
Speculative Decoding
04:58

Unlock unlimited learning

Get instant access to all 20 lessons in this course, plus thousands of other premium courses. One subscription, unlimited knowledge.

Learn more about subscription

Books

Read Book AI Engineering Course

#Title
11. Vector+Embeddings+&+Semantic+Space
22. Compression+&+Quantization_+Scaling+Vectors+Efficiently-4
33. Indexing+Techniques_+Making+Vector+Search+Scale
44. Search+Execution+Flow_+From+Query+to+Result
55. LLMs+and+RAG
66. What+is+Attention+and+Why+Does+It+Matter
77. Paged+Attention
88. Quantization+Summary

Comments

0 comments

Want to join the conversation?

Sign in to comment

Similar courses

Build Your SaaS AI Web Platform From Zero to Production

Build Your SaaS AI Web Platform From Zero to Production

Sources: Code4Startup (coderealprojects)
Discover the new trend in the world of startups and indie hackers - the creation of microservice AI-SaaS products that do more than just meet needs...
8 hours 36 minutes 2 seconds
Build a Reasoning Model (From Scratch)

Build a Reasoning Model (From Scratch)

Sources: Sebastian Raschka
Understand how LLMs reason by creating your own reasoning model from scratch. In the book "Building a Reasoning Model from Scratch," you will step by step...
5 Levels of Agents - Coding Agents

5 Levels of Agents - Coding Agents

Sources: Mckay Wrigley (takeoff)
This course teaches the creation of intelligent coding agents by going through five levels of complexity. You will learn to develop agents for review and...
5 hours 4 minutes 36 seconds
Build and Deploy a B2B SaaS AI Support Platform

Build and Deploy a B2B SaaS AI Support Platform

Sources: Code With Antonio
In this course, we will build a customer support platform powered by AI from scratch: we will set up a live chat using Convex Agents, add voice support through.
22 hours 20 minutes 55 seconds
Build AI Agents with n8n

Build AI Agents with n8n

Sources: zerotomastery.io
Learn how to create AI agents in n8n without coding. Discover how to integrate language models, configure triggers, and set up nodes for task automation.
2 hours 51 minutes 16 seconds