AI Engineering: Customizing LLMs for Business (Fine-Tuning LLMs with QLoRA & AWS)

Name: AI Engineering: Customizing LLMs for Business (Fine-Tuning LLMs with QLoRA & AWS)
Price: 9 USD
Availability: InStock

7h 12m 10s

English

Paid

October 18, 2025

Course description

Master an in-demand skill that companies are looking for: developing and implementing custom LLMs. In this course, you will learn how to fine-tune open large language models on closed/corporate data and deploy your models using AWS (SageMaker, Lambda, API Gateway) and Streamlit to provide a convenient interface for employees and clients.

This is not "just another introductory AI course." It is a practical deep dive into the skills that set AI engineers apart on real projects. You will perform fine-tuning using QLoRA, a method that drastically reduces resource consumption, and then turn the model into a production service.

Watch Online

0:00

#1: Course Introduction (What We're Building)

All Course Lessons (58)

#	Lesson Title	Duration
1	Course Introduction (What We're Building) Demo	05:20
2	Signing in to AWS	04:31
3	Creating an IAM User	05:30
4	Using our new IAM User	03:13
5	What To Do In Case You Get Hacked!	01:31
6	Creating a SageMaker Domain	02:29
7	Logging in to our SageMaker Environment	04:54
8	Introduction to JupyterLab	07:38
9	Sagemaker Sessions, Regions, and IAM Roles	07:51
10	Examining Our Dataset from HuggingFace	13:30
11	Tokenization and Word Embeddings	09:09
12	HuggingFace Authentication with Sagemaker	04:22
13	Applying the Templating Function to our Dataset	08:44
14	Attention Masks and Padding	15:56
15	Star Unpacking with Python	04:04
16	Chain Iterator, List Constructor and Attention Mask example with Python	10:23
17	Understanding Batching	08:12
18	Slicing and Chunking our Dataset	07:32
19	Creating our Custom Chunking Function	16:07
20	Tokenizing our Dataset	09:31
21	Running our Chunking Function	04:31
22	Understanding the Entire Chunking Process	08:33
23	Uploading the Training Data to AWS S3	05:54
24	Setting Up Hyperparameters for the Training Job	06:48
25	Creating our HuggingFace Estimator in Sagemaker	06:46
26	Introduction to Low-rank adaptation (LoRA)	08:12
27	LoRA Numerical Example	10:56
28	LoRA Summarization and Cost Saving Calculation	09:09
29	(Optional) Matrix Multiplication Refresher	04:46
30	Understanding LoRA Programatically Part 1	12:33
31	Understanding LoRA Programatically Part 2	05:49
32	Bfloat16 vs Float32	08:11
33	Comparing Bfloat16 Vs Float32 Programatically	06:33
34	Setting up Imports and Libraries for the Train Script	07:20
35	Argument Parsing Function Part 1	07:57
36	Argument Parsing Function Part 2	10:55
37	Understanding Trainable Parameters Caveats	14:31
38	Introduction to Quantization	07:36
39	Identifying Trainable Layers for LoRA	07:20
40	Setting up Parameter Efficient Fine Tuning	04:36
41	Implement LoRA Configuration and Mixed Precision Training	10:35
42	Understanding Double Quantization	04:22
43	Creating the Training Function Part 1	14:15
44	Creating the Training Function Part 2	07:17
45	Exercise: Imposter Syndrome	02:57
46	Finishing our Sagemaker Script	05:09
47	Gaining Access to Powerful GPUs with AWS Quotas	05:11
48	Final Fixes Before Training	03:55
49	Starting our Training Job	07:16
50	Inspecting the Results of our Training Job and Monitoring with Cloudwatch	11:24
51	Deploying our LLM to a Sagemaker Endpoint	17:58
52	Testing our LLM in Sagemaker Locally	08:19
53	Creating the Lambda Function to Invoke our Endpoint	08:56
54	Creating API Gateway to Deploy the Model Through the Internet	02:37
55	Implementing our Streamlit App	05:12
56	Streamlit App Correction	03:27
57	Congratulations and Cleaning up AWS Resources	02:39
58	Thank You!	01:18

Unlock unlimited learning

Get instant access to all 57 lessons in this course, plus thousands of other premium courses. One subscription, unlimited knowledge.

Learn more about subscription

Comments

0 comments

Want to join the conversation?

Similar courses

RAG: Beyond Basics

Sources: Prompt Engineering

The course is dedicated to the practical and theoretical study of Retrieval-Augmented Generation (RAG). You will learn not only "how" but also "why" these...

2 hours 40 minutes 48 seconds

Build an End-to-End Web App from Scratch in AWS

Sources: zerotomastery.io

Learn to design and build a real web application with AWS. This project-based course will guide you through using five AWS services to create something you can

31 minutes 54 seconds

AWS Certified Developer - Associate

Sources: Adrian Cantrill

The AWS Certified Developer Associate is one of the most valuable and in-demand cloud certifications, part of the AWS Associate (All-3) series. It covers all of the core AWS ser...

68 hours 8 minutes 48 seconds

Building LLMs for Production

Sources: Towards AI, Louis-François Bouchard

"Creating LLM for Production" is a practical guide spanning 470 pages (updated in October 2024), designed for developers and specialists...