Skip to main content
CF

Master the First Principles of Computer Vision

2h 48m 10s
English
Paid

Master the First Principles of Computer Vision is a 33-lesson 2 hours 48 minutes self-paced course by Elliot Lee, UBICODERS. Master the fundamental principles of computer vision and build perception systems from scratch instead of using ready-made solutions.

Course facts

Lessons
33
Duration
2 hours 48 minutes
Level
All levels
Language
English
Updated
Instructor
Elliot Lee, UBICODERS
Price
Premium

Master the fundamental principles of computer vision and build perception systems from scratch instead of using ready-made solutions.

Why understanding the basics of computer vision is important

In the internet age, many engineers mistakenly believe that it's enough to simply import a YOLO model from GitHub or use a wrapper from Hugging Face to work with computer vision systems. However, such solutions often turn the engineer into a "pixel consumer," limited to two-dimensional analysis.

Problems with the approach of ready-made solutions

Problems become apparent when conditions change: lighting, camera position, or robot movement in a new environment. Without a deep understanding of spatial geometry, such systems often end up being "blind."

An image should be considered a mathematical projection of three-dimensional reality, not just a picture.

What you will learn during the course

The course offers the understanding and creation of a complete perception system from scratch. You will delve into camera geometry and 3D reconstruction of the surrounding world.

Key topics of the course

  • The connection between points in real space and image pixels.
  • Working with camera parameters (intrinsics and extrinsics).
  • How the mathematical projection model turns an image into a source of spatial data.
  • Computer vision methods for robotics: optical flow, feature matching, triangulation, and stereovision.

Building a full 3D perception pipeline

Special attention is given to building the pipeline—from camera calibration, lens distortion correction to creating depth maps and point clouds. You will learn to transform pixels into spatial knowledge and design systems that work in real conditions.

Who the course is for

The course is designed for engineers looking to go beyond using libraries and understand the mathematics of computer vision. Ideal for those who want to design perception systems for robotics, autonomous devices, and intelligent machines.

Conclusion

By mastering the principles of camera geometry and spatial analysis, you'll be able not only to apply computer vision tools but also to create system architectures that truly understand the surrounding world.

Who teaches Master the First Principles of Computer Vision?

Elliot Lee

Elliot Lee thumbnail

Hongyun “Elliot” Lee — a robotics engineer and founder of several small projects in the field of autonomous systems and robotics education. He is best known as the creator of the Ubicoders platform.

Key Facts

  • Full name: Hongyun (Elliot) Lee
  • Profession: aerospace / robotics software engineer
  • Founder of the company AIR&H Aerospace Inc.
  • Founder of the educational platform Ubicoders
  • Location: Richmond, British Columbia, Canada

UBICODERS

UBICODERS thumbnail

Ubicoders is a small educational platform focusing on robotics, autonomous systems, and computer vision. The main idea is to teach engineers to create robots and autonomous devices through mathematics, code, and simulations.

Course topics include:

  • robotics engineering
  • computer vision
  • machine learning
  • visual odometry
  • SLAM
  • ROS2
  • embedded systems
  • robot control (PID, control theory)

What lessons are included in Master the First Principles of Computer Vision?

This is a demo lesson (10:00 remaining)

You can watch up to 10 minutes for free. Subscribe to unlock all 33 lessons in this course and access 10,000+ hours of premium content across all courses.

View Pricing
0:00
/
#1: 001 Course Intro
All Course Lessons (33)
#Lesson TitleDurationAccess
1
001 Course Intro Demo
01:14
2
002 Mac VSCode Install
00:24
3
003 Ubuntu VSCode Install
00:23
4
004 Windows VSCode Install
00:31
5
005 (Optional) What is Anaconda
01:21
6
006 Mac Conda Install
01:01
7
007 Ubuntu Conda Install
00:31
8
008 Windows Conda Install
00:59
9
009 Course Environment Setup
02:37
10
010 Image as Data
08:32
11
011 Cropping The Image
06:19
12
012 Filtering the RED
09:09
13
013 Detecting the RED
06:46
14
014 Tracking the RED
06:30
15
015 The First Image Processing
07:21
16
016 Feature Descriptors and Feature Matching
13:35
17
017 Optical Flow
08:02
18
018 Camera Intrinsics Quick Dive
05:51
19
019 Camera Intrinsics Python Code
06:58
20
020 Camera Intrinsics Deeper Dive
09:14
21
021 Camera Extrinsic Parameters
09:09
22
022 Camera Extrinsic Parameters Python Code
16:01
23
023 Introduction
01:59
24
024 Camera Calibration Simple Concept
02:42
25
025 Camera Calibration Simple Concept in Python
06:08
26
026 Camera Calibration with Distortion
03:41
27
027 Camera Calibration using OpenCV Python
07:43
28
028 Stereo Camera Calibration using OpenCV Python
06:34
29
029 3D Depth Estimation - Triangulation
04:54
30
030 3D Depth Estimation - Triangulation Code
05:47
31
031 3D Depth Estimation - Disparity
01:52
32
032 3D Depth Estimation - Disparity Code
03:05
33
033 Outro
01:17
Unlock unlimited learning

Get instant access to all 32 lessons in this course, plus thousands of other premium courses. One subscription, unlimited knowledge.

Learn more about subscription

More courses by Elliot Lee

Frequently asked questions

What prerequisites should I have before enrolling in this course?
Before enrolling in this course, students should have a basic understanding of programming, particularly in Python. Familiarity with basic mathematical concepts, particularly in linear algebra and geometry, will be beneficial, as the course includes topics such as camera geometry and 3D reconstruction. Additionally, some experience with VSCode and Anaconda is helpful, as the course includes installation and setup lessons for these tools.
What kind of projects will I build during this course?
During the course, students will build a complete perception system from scratch. This includes exercises on image processing, tracking, and feature matching. Key projects involve working with camera parameters for 3D reconstruction, building a 3D perception pipeline, and implementing computer vision methods like optical flow, triangulation, and stereovision.
Who is the target audience for this course?
The course is designed for engineers and developers interested in mastering the fundamental principles of computer vision. It is particularly suitable for those who want to go beyond using ready-made solutions and wish to understand the mathematical and spatial concepts behind 3D perception systems. It is also aimed at professionals looking to apply computer vision methods to robotics and other fields.
How does this course compare to others in terms of depth and scope?
This course provides an in-depth exploration of computer vision principles, focusing on building perception systems from scratch. Unlike courses that rely on pre-built models or libraries, this course dives deeply into the mathematics of camera geometry and 3D reconstruction. It covers practical implementations, such as camera calibration and depth estimation, offering a robust foundation compared to courses that focus only on high-level API usage.
What specific tools and platforms are covered in the course?
The course includes lessons on setting up and using VSCode and Anaconda across different operating systems (Mac, Ubuntu, and Windows). Additionally, it covers the use of OpenCV for camera calibration and 3D depth estimation tasks. The course also involves coding exercises in Python, focusing on practical implementations of computer vision techniques.
What topics are not covered in this course?
The course does not cover advanced machine learning models or the use of pre-trained models from frameworks like YOLO or Hugging Face. It focuses on foundational concepts and building systems from scratch, rather than on high-level abstractions or deep learning techniques typically used in modern computer vision applications.
What is the expected time commitment for this course?
The course consists of 33 lessons, each focusing on different aspects of computer vision. While the total runtime is not specified, students should expect to spend additional time on practical exercises and coding assignments. The depth of the material suggests a significant commitment to fully grasp the concepts and apply them effectively, so dedicating a few hours per week is advisable.