AI Engineering Bootcamp: RAG (Retrieval Augmented Generation) for LLMs
17h 51m 59s
English
Paid
This course shows you how to build smarter AI apps with RAG. You use RAG to give LLMs fresh facts from your own data. This helps you make tools like chatbots, search apps, and simple analysis systems.
Why RAG matters
LLMs learn from old training data. This can limit their answers. RAG helps you fix this by pulling new facts from files, links, and other sources. You then pass these facts to the model. This leads to clear and useful results in real tasks.
Example: A shop chatbot can check live stock data. It does not guess based on old samples. It gives you clear info about items and delivery dates.
What you learn
Retrieval basics
How to clean and prepare text for search
How Boolean, vector, and simple probabilistic search work
How indexing, queries, and ranking work
Generative model basics
How transformers and attention work
How to prepare data and train text models
Intro to RAG
How search and generation work together
How RAG helps in real projects
Using the OpenAI API
How to set up the API and write clear prompts
How model settings change output
Building RAG with OpenAI
How to build a simple end‑to‑end RAG system
How to link search results with model output
Working with unstructured data
How to read data from PDF, Word, PowerPoint, Excel, and images
How to pull useful text from mixed formats
Multimodal RAG
How to use both text and images in one flow
How to merge data types into one answer
Agent systems with RAG
How to build simple agents that talk to users and run tasks
How to track agent state and run steps in order
Why this course helps you
You get hands-on practice with key RAG steps. You learn how to build tools that use fresh data, handle wide search tasks, and give clear answers. These skills help you build strong AI apps for real work.
Zero To Mastery (ZTM) is a Toronto-based online coding academy founded by Andrei Neagoie, originally a senior developer at large Canadian tech firms before turning to teaching full-time. The academy's signature is the cohort-based bootcamp track combined with a deep self-paced course library, all aimed at career-changers and self-taught developers preparing to land software-engineering roles at top companies.
The instructor roster has grown well beyond Andrei to include other senior practitioners: Daniel Bourke (machine learning), Aleksa Tešić (DevOps), Jacinto Wong, and others. Courses cover the full software-engineering career path: web development with React and Next.js, Python, machine learning and deep learning, DevOps and cloud, system design, mobile, and the algorithm / data-structure interview prep that gates engineering jobs.
The CourseFlix listing under this source carries over 120 ZTM courses spanning that full range. Material is paid; ZTM itself runs on a monthly / annual membership model. The teaching style favours long-form, project-based courses where students build complete portfolio-quality applications rather than disconnected feature tutorials.
Watch Online 171 lessons
This is a demo lesson (10:00 remaining)
You can watch up to 10 minutes for free. Subscribe to unlock all 171 lessons in this course and access 10,000+ hours of premium content across all courses.
Unlock the potential of Retrieval-Augmented Generation (RAG) as you delve into this comprehensive course designed to equip you with the skills to create.
By: Vue School, Justin Schroeder, Daniel Kelly, Garrison Snelling
Study the RAG approach to enhance AI with your own data. Learn about vectors, embeddings, and integration. Apply the approach in real projects.
26m
Frequently asked questions
What prerequisites are needed before taking this course?
Before enrolling in this course, students should have a basic understanding of Python programming, as it is used extensively in the lessons. Familiarity with basic concepts of machine learning and AI models will also be beneficial, particularly when covering topics like transformers and attention mechanisms. Some lessons address API usage, so prior experience with APIs would be advantageous but not strictly necessary.
What type of projects will I build during the course?
Throughout the course, students will engage in practical projects including building a LinkedIn Post Writer App and a Mini Rubber Ducky AI. These projects incorporate the principles of Retrieval Augmented Generation (RAG) by integrating real-time data retrieval with generative model outputs. Students also learn to design user interfaces and deploy applications using tools like the OpenAI API.
Who is the target audience for this course?
The course is designed for aspiring AI engineers and developers who want to create smarter AI applications using Retrieval Augmented Generation. It is suitable for those interested in building applications like chatbots and search systems that require real-time data integration. The course also caters to individuals looking to enhance their skills in using the OpenAI API for practical applications.
How does this course compare in scope to other AI courses?
This course focuses specifically on RAG for LLMs, blending retrieval techniques with generative models to enhance AI application performance. Unlike more generalized AI courses, it covers niche topics such as multimodal RAG and working with unstructured data formats. It goes beyond basic AI concepts to offer hands-on experience with the OpenAI API and building end-to-end RAG systems.
What tools and platforms will I learn to use?
The course extensively covers the OpenAI API, teaching students how to set it up, write effective prompts, and configure model settings. Other tools include vector stores for efficient data retrieval and various programming exercises to integrate data from formats like PDF, Word, and Excel. Students also learn to use image processing techniques within the RAG framework.
What topics are not covered in this course?
The course does not cover deep learning model architecture design from scratch or advanced mathematical foundations of machine learning algorithms. It focuses on practical applications of RAG and does not delve into the development of custom AI models or training large datasets using frameworks like TensorFlow or PyTorch.
How much time should I expect to commit to this course?
The course comprises 171 lessons, each designed to progressively build on the previous one. While the exact runtime is not specified, students should be prepared to commit several hours each week to fully engage with the material, complete practical exercises, and participate in project work. The time commitment will vary based on individual learning pace and prior experience.