Course Contents
This course dives into advanced concepts in computer vision. A first focus is geometry in computer vision, including image formation, represnetation theory for vision, classic multi-view geometry, multi-view geometry in the age of deep learning, differentiable rendering, neural scene representations, correspondence estimation, optical flow computation, and point tracking.
Next, we explore generative modeling and representation learning including image and video generation, guidance in diffusion models, conditional probabilistic models, as well as representation learning in the form of contrastive and masking-based methods.
Finally, we will explore the intersection of robotics and computer vision with "vision for embodied agents", investigating the role of vision for decision-making, planning and control.
Schedule
Tuesday |  –  |
Thursday |  –  |
Syllabus
Module 0: Introduction to Computer Vision | ||
---|---|---|
Introduction to Vision |
| |
Module 1: Module 1: Geometry, 3D and 4D | ||
What is an Image: Pinhole Cameras & Projective Geometry |
| |
Linear Image Processing & Transformations |
| |
Representation Theory in Vision |
| |
No Class (Monday Schedule)holiday | ||
Geometric Deep Learning (or the lack thereof) for Vision |
| |
Correspondence, Optical Flow, and Scene Flow |
| |
Correspondence, Optical Flow, and Scene Flow 2 |
| |
Multi-View Geometry 1 |
| |
Data Structures and Signal Parameterizations |
| |
Differentiable Rendering & Novel View Synthesis |
| |
Differentiable Rendering & Novel View Synthesis 2 |
| |
Prior-Based 3D Reconstruction and Novel View Synthesis |
| |
Open Problems in Geometry, 3D, and 4D |
| |
Student Holiday: Spring Breakholiday | ||
Student Holiday: Spring Breakholiday | ||
Module 2: Module 2: Unsupervised Representation Learning and Generative Modeling | ||
Introduction to Representation Learning and Generative Modeling |
| |
Latent Variable Models and VAEs |
| |
Diffusion Models |
| |
Diffusion Models 2 |
| |
Sequence Generative Models |
| |
Bridging Domain Gaps |
| |
Non-Generative Representation Learning |
| |
Open Problems in Representation Learning |
| |
Module 3: Module 3: Vision for Embodied Agents | ||
Introduction to Robotic Perception |
| |
Sequence Generative Modeling for Decision-Making |
| |
Vision for Inverse Kinematics and State Estimation |
|