Skip to main content
CF

AI for Beginners: Reasoning Models

4h 37m 14s
English
Paid

This course shows you how AI reasoning models work in clear, simple steps.You learn what they do, how they think, and where they fail.

What Reasoning Models Do

Reasoning models use a step-by-step scratchpad to solve tasks. This process is slow and careful, like human System 2 thinking. It can look like magic at first, but it is not. Here, you learn what happens inside the model as it works through a problem.

How They Form a Reasoning Chain

You study how a model builds each step in its chain of thoughts. You see how it deals with hard tasks and where it breaks down. Short hands-on tasks help you spot patterns in the model’s behavior.

How These Models Learn

You explore the training methods that shape the model’s skills. You learn how these methods guide the model toward better answers.

Reinforcement Learning

You examine how feedback helps a model improve. This includes RLHF and newer training ideas.

Reward Models and Data

You look at procedural reward models and the PRM800K dataset. You see how these tools change model behavior.

Scaling and Test-Time Compute

You learn how model size and compute at run time affect reasoning quality. This helps you guess where the field is going.

When Models Mislead You

Some models hide parts of their reasoning or act in a strategic way. You study real cases where the model gives false paths or masks its inner steps.

How to Spot Problems

You learn simple checks to catch these issues. These skills help you judge if the model’s answer is sound or risky.

Additional

Course HandBook - https://half-money-bd8.notion.site/Course-Handbook-6234be19ffcd4e02991fa7c5227d21b3

About the Author: Zero To Mastery

Zero To Mastery thumbnail

Zero To Mastery (ZTM) is a Toronto-based online coding academy founded by Andrei Neagoie, originally a senior developer at large Canadian tech firms before turning to teaching full-time. The academy's signature is the cohort-based bootcamp track combined with a deep self-paced course library, all aimed at career-changers and self-taught developers preparing to land software-engineering roles at top companies.

The instructor roster has grown well beyond Andrei to include other senior practitioners: Daniel Bourke (machine learning), Aleksa Tešić (DevOps), Jacinto Wong, and others. Courses cover the full software-engineering career path: web development with React and Next.js, Python, machine learning and deep learning, DevOps and cloud, system design, mobile, and the algorithm / data-structure interview prep that gates engineering jobs.

The CourseFlix listing under this source carries over 120 ZTM courses spanning that full range. Material is paid; ZTM itself runs on a monthly / annual membership model. The teaching style favours long-form, project-based courses where students build complete portfolio-quality applications rather than disconnected feature tutorials.

Watch Online 31 lessons

This is a demo lesson (10:00 remaining)

You can watch up to 10 minutes for free. Subscribe to unlock all 31 lessons in this course and access 10,000+ hours of premium content across all courses.

View Pricing
0:00
/
#1: Introduction
All Course Lessons (31)
#Lesson TitleDurationAccess
1
Introduction Demo
03:41
2
Replay: Chain-of-Thought Prompting - Part 1
05:25
3
Replay: Chain-of-Thought Prompting - Part 2
05:45
4
Introduction to Reasoning Models
09:24
5
First Contact with Reasoning
16:48
6
Secrets and Lies!
12:15
7
Setting Up Our Open Source Reasoning Model
05:52
8
A Reasoning Model's Real Thoughts - Part 1
05:16
9
A Reasoning Model's Real Thoughts - Part 2
08:41
10
Thinking Like LLMs - Breaking The Chains
12:16
11
What Are Reasoning Models Good For? (The Generator-Verifier Gap)
13:33
12
Exercise: Determine GVG
10:08
13
Prompt Engineering for Reasoning Models
07:28
14
Context Engineering
18:20
15
Thinking Like LLMs: Cats Are...Confusing? - Part 1
10:22
16
Thinking Like LLMs: Cats Are...Confusing? - Part 2
07:11
17
Reinforcement Learning - The Problem
06:21
18
Reinforcement Learning - How It Works
15:03
19
RL Environments (Soccer)
04:19
20
RL Environments (Go)
07:47
21
Reinforcement Learning from Human Feedback (RLHF)
16:07
22
Reinforcement Learning for Reasoning Models - Let's Verify Step-By-Step
06:36
23
Reinforcement Learning for Reasoning Models - Process Reward Model
09:28
24
PRM800K Introduction
07:41
25
PRM800K Deep Dive
13:12
26
Test-Time Compute
12:41
27
Are Reasoning Models Lying To You? - Part 1
11:08
28
Are Reasoning Models Lying To You? - Part 2
02:43
29
Are Reasoning Models Lying To You? - Part 3
07:54
30
Are Reasoning Models Lying To You? - Part 4
02:52
31
Let's Keep Learning Together!
00:57
Unlock unlimited learning

Get instant access to all 30 lessons in this course, plus thousands of other premium courses. One subscription, unlimited knowledge.

Learn more about subscription

Books

Read Book AI for Beginners: Reasoning Models

#TitleTypeOpen
1Reasoning & The Context Window PDF
2Mandatory Homework The Thinking Game PDF
3Exercise Compare Reasoning Style of Different Models PDF
4Exerciseï Code Your Own Maze Game PDF

Related courses

  • Build a DeepSeek Model (From Scratch) thumbnailNew

    Build a DeepSeek Model (From Scratch)

    By: Rajat Dandekar, Naman Dwivedi, Dr. Sreedath Pana
    Learn how to build a DeepSeek model from scratch. A practical guide with a focus on engineering and algorithmic solutions for efficient model performance.
  • Vibe Code a Generative AI Finance App with Python and LangChain thumbnailNew

    Vibe Code a Generative AI Finance App with Python and LangChain

    By: Zero To Mastery
    Master the creation of AI applications for investments using Python and LangChain. Practice developing a fintech application and understanding financial metrics
    7h 36m5/5
  • AI Voice Agents with AWS thumbnailNew

    AI Voice Agents with AWS

    By: Zero To Mastery
    Study the creation of voice AI agents using AWS and Python. Develop an assistant with real functionalities and a deep understanding of the architecture.
    3h 1m5/5

Frequently asked questions

What is AI for Beginners: Reasoning Models about?
This course shows you how AI reasoning models work in clear, simple steps. You learn what they do, how they think, and where they fail. What Reasoning Models Do Reasoning models use a step-by-step scratchpad to solve tasks. This process is…
Who teaches this course?
It is taught by Zero To Mastery. You can find more courses by this instructor on the corresponding source page.
How long is the course?
It contains 31 lessons with a total runtime of 4 hours 37 minutes. Every lesson is available to watch online at your own pace.
Is it free to watch?
It is part of CourseFlix's premium catalog. A subscription unlocks the full video player; the course description, table of contents, and preview information are available to everyone.
Where can I watch it online?
The course is available to watch online on CourseFlix at https://courseflix.net/course/ai-for-beginners-reasoning-models. The page hosts every lesson with the integrated video player; no download is required.