cmu-vision.github.io/courses.html at main · cmu-vision/cmu-vision.github.io · GitHub

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
<!DOCTYPE html>
<html lang="en">
<head>
  <meta charset="UTF-8">
  <meta name="viewport" content="width=device-width, initial-scale=1.0">
  <title>Courses — Computer Vision @ CMU</title>
  <link rel="icon" href="./assets/ri-favicon.ico">
  <link rel="preconnect" href="https://fonts.googleapis.com">
  <link rel="preconnect" href="https://fonts.gstatic.com" crossorigin>
  <link href="https://fonts.googleapis.com/css2?family=Inter:wght@400;500;600;700&display=swap" rel="stylesheet">
  <link rel="stylesheet" href="./css/style.css">
</head>
<body>

  <!-- Navigation -->
  <nav class="nav">
    <div class="nav__inner">
      <a href="./index.html" class="nav__logo">
        <img src="./assets/logo.svg" alt="Computer Vision @ Carnegie Mellon">
      </a>
      <button class="nav__hamburger" aria-label="Toggle menu" aria-expanded="false">
        <span></span><span></span><span></span>
      </button>
      <div class="nav__links">
        <a href="./index.html">Overview</a>
        <a href="./people.html">People</a>
        <a href="./research.html">Research</a>
        <a href="./papers.html">Papers</a>
        <a href="./courses.html" class="active">Courses</a>
        <a href="./sponsorship.html">Sponsorship</a>
      </div>
    </div>
  </nav>

  <!-- Page Header -->
  <div class="page-header">
    <div class="container">
      <h1 class="page-header__title">Courses</h1>
      <p class="page-header__subtitle">Graduate and undergraduate courses in computer vision, computational imaging, robotics, and related fields.</p>
    </div>
  </div>

  <!-- Course List -->
  <section class="section">
    <div class="container">
      <div class="grid--courses">
        <!-- autogen courses -->
<div class="card card--course">
  <img class="card--course__thumb" src="courses/15463.png" alt="Computational Photography">
  <div class="card--course__body">
    <div class="card--course__number">15-463, 15-663, 15-862</div>
    <h3 class="card--course__title">Computational Photography</h3>
    <p class="card--course__desc">Computational photography is the convergence of computer graphics, computer vision, optics and imaging. Its role is to overcome the limitations of traditional cameras, by combining imaging and computation to enable new and enhanced ways of capturing, representing, and interacting with the physical world. This advanced undergraduate course provides a comprehensive overview of the state of the art in computational photography. At the start of the course, we will study modern image processing pipelines, including those encountered on mobile phone and DSLR cameras, and advanced image and video editing algorithms. Then we will continue to learn about the physical and computational aspects of tasks such as 3D scanning, coded photography, lightfield imaging, time-of-flight imaging, VR/AR displays, and computational light transport. Near the end of the course, we will discuss active research topics, such as creating cameras that capture video at the speed of light, cameras that look around walls, or cameras that can see below skin.</p>
    <a class="card--course__link" href="http://graphics.cs.cmu.edu/courses/15-463/">Course Website &rarr;</a>
  </div>
</div>
<div class="card card--course">
  <img class="card--course__thumb" src="courses/15468.jpg" alt="Physics-based Rendering">
  <div class="card--course__body">
    <div class="card--course__number">15-468, 15-668, 15-868</div>
    <h3 class="card--course__title">Physics-based Rendering</h3>
    <p class="card--course__desc">This course is an introduction to physics-based rendering at the advanced undergraduate and introductory graduate level. During the course, we will cover fundamentals of light transport, including topics such as the rendering and radiative transfer equation, light transport operators, path integral formulations, and approximations such as diffusion and single scattering. Additionally, we will discuss state-of-the-art models for illumination, surface and volumetric scattering, and sensors. Finally, we will use these theoretical foundations to develop Monte Carlo algorithms and sampling techniques for efficiently simulating physically-accurate images. Towards the end of the course, we will look at advanced topics such as rendering wave optics, neural rendering, and differentiable rendering.</p>
    <a class="card--course__link" href="http://graphics.cs.cmu.edu/courses/15-468/">Course Website &rarr;</a>
  </div>
</div>
<div class="card card--course">
  <img class="card--course__thumb" src="courses/16385.png" alt="Computer Vision">
  <div class="card--course__body">
    <div class="card--course__number">16-385</div>
    <h3 class="card--course__title">Computer Vision</h3>
    <p class="card--course__desc">This course provides a comprehensive introduction to computer vision. Major topics include image processing, detection and recognition, geometry-based and physics-based vision and video analysis. Students will learn basic concepts of computer vision as well as hands on experience to solve real-life vision problems.</p>
    <a class="card--course__link" href="http://16385.courses.cs.cmu.edu/spring2022/">Course Website &rarr;</a>
  </div>
</div>
<div class="card card--course">
  <img class="card--course__thumb" src="courses/16385.png" alt="Computer Vision">
  <div class="card--course__body">
    <div class="card--course__number">16-720B</div>
    <h3 class="card--course__title">Computer Vision</h3>
    <p class="card--course__desc">This course introduces the fundamental techniques used in computer vision, that is, the analysis of patterns in visual images to reconstruct and understand the objects and scenes that generated them. Topics covered include image processing basics, Hough Transforms, feature detection, feature descriptors, image representations, image classification and object detection. We will also cover camera geometry, multi-view geometry, stereo, 3D reconstruction from images, optical flow, motion analysis and tracking. Version B of 16-720 is intended for students with prior knowledge of computer vision and prior exposure to machine learning. Undergraduate students should take 16-385 which is the undergraduate version of the class.</p>
    <a class="card--course__link" href="https://kriskitani.github.io/courses/16720B/">Course Website &rarr;</a>
  </div>
</div>
<div class="card card--course">
  <img class="card--course__thumb" src="courses/16720A.png" alt="Computer Vision">
  <div class="card--course__body">
    <div class="card--course__number">16-720A</div>
    <h3 class="card--course__title">Computer Vision</h3>
    <p class="card--course__desc">This course introduces the fundamental techniques used in computer vision, that is, the analysis of patterns in visual images to reconstruct and understand the objects and scenes that generated them. The first third of the course covers low-level image processing, including filtering, warping, image descriptors, and correspondence matching. The second third of the course covers geometry and 3D motion, including image formation, camera models, optical flow, stereo, and structure from motion. The last third of the course covers pattern recognition including deep learning, convolutional neural networks. Additional topics include radiometry, color, and photometric stereo. Prerequisites include linear algebra, probabiliity, and calculus. Courses related to 16-720A include 16-385 and 16-720B.  Undergraduates should take 16-385, which serves as the undergraduate version of this course). Graduate students with little exposure to computer vision should take 16-720A, which serves as the introductory graduate version of this course. Graduate students with prior exposure to computer vision should take 16-720B, which serves as the advanced version of this course.</p>
    <a class="card--course__link" href="https://canvas.cmu.edu/courses/30701">Course Website &rarr;</a>
  </div>
</div>
<div class="card card--course">
  <img class="card--course__thumb" src="https://www.cs.cmu.edu/~junyanz/imgs/16726.jpg" alt="Learning-Based Image Synthesis">
  <div class="card--course__body">
    <div class="card--course__number">16-726</div>
    <h3 class="card--course__title">Learning-Based Image Synthesis</h3>
    <p class="card--course__desc">This course introduces machine learning methods for image and video synthesis. The objectives of synthesis research vary from modeling statistical distributions of visual data, through realistic picture-perfect recreations of the world in graphics, and all the way to providing interactive tools for artistic expression. Key machine learning algorithms will be presented, ranging from classical learning methods (e.g., nearest neighbor, PCA) to deep learning models (e.g., ConvNets, NeRF, deep generative models, including GANs, VAEs, autoregressive models, and diffusion models). Finally, we will discuss image and video forensics methods for detecting synthetic content. Students will learn to build practical applications and create new visual effects using their own photos and videos.</p>
    <a class="card--course__link" href="https://learning-image-synthesis.github.io/">Course Website &rarr;</a>
  </div>
</div>
<div class="card card--course">
  <img class="card--course__thumb" src="https://geometric3d.github.io/data/teaserim.png" alt="Geometry-based Methods in Vision">
  <div class="card--course__body">
    <div class="card--course__number">16-822</div>
    <h3 class="card--course__title">Geometry-based Methods in Vision</h3>
    <p class="card--course__desc">The course focuses on the geometric aspects of computer vision: The geometry of image formation and its use for 3D reconstruction and calibration. The objective of the course is to introduce the formal tools and results that are necessary for developing multi-view reconstruction algorithms. The fundamental tools introduced study affine and projective geometry, which are essential to the development of image formation models. These tools are then used to develop formal models of geometric image formation for a single view (camera model), two views (fundamental matrix), and three views (trifocal tensor); 3D reconstruction from multiple images; auto-calibration; and learning based methods.</p>
    <a class="card--course__link" href="https://geometric3d.github.io/">Course Website &rarr;</a>
  </div>
</div>
<div class="card card--course">
  <img class="card--course__thumb" src="courses/16823.png" alt="Physics based Methods in Vision">
  <div class="card--course__body">
    <div class="card--course__number">16-823</div>
    <h3 class="card--course__title">Physics based Methods in Vision</h3>
    <p class="card--course__desc">Everyday, we observe an extraordinary array of light and color phenomena around us, ranging from the dazzling effects of the atmosphere, the complex appearances of surfaces and materials, and underwater scenarios. For a long time, artists, scientists, and photographers have been fascinated by these effects, and have focused their attention on capturing and understanding these phenomena. In this course, we take a computational approach to modeling and analyzing these phenomena, which we collectively call &quot;visual appearance&quot;. The first half of the course focuses on the physical fundamentals of visual appearance, while the second half of the course focuses on algorithms and applications in a variety of fields such as computer vision, graphics and remote sensing and technologies such as underwater and aerial imaging. This course unifies concepts usually learnt in physical sciences and their application in imaging sciences. Students attending this course will learn about the fundamental building blocks that describe visual appearance, and recent academic papers on a variety of physics-based methods that measure, process, and analyze visual information from the real world.</p>
    <a class="card--course__link" href="https://www.cs.cmu.edu/~motoole2/16823-s20/">Course Website &rarr;</a>
  </div>
</div>
<div class="card card--course">
  <img class="card--course__thumb" src="https://visual-learning.cs.cmu.edu/images/teaser.jpg" alt="Visual Learning and Recognition">
  <div class="card--course__body">
    <div class="card--course__number">16-824</div>
    <h3 class="card--course__title">Visual Learning and Recognition</h3>
    <p class="card--course__desc">This graduate-level computer vision course focuses on representation and reasoning for large amounts of data (images, videos, and associated tags, text, GPS locations, etc.) toward the ultimate goal of understanding the visual world surrounding us. We will be reading an eclectic mix of classic and recent papers on topics including Theories of Perception, Mid-level Vision (Grouping, Segmentation, Poses), Object and Scene Recognition, 3D Scene Understanding, Action Recognition, Contextual Reasoning, Joint Language and Vision Models, Deep Generative Models, etc. We will be covering a wide range of supervised, semi-supervised and unsupervised approaches for each of the topics above.</p>
    <a class="card--course__link" href="https://visual-learning.cs.cmu.edu/">Course Website &rarr;</a>
  </div>
</div>
<div class="card card--course">
  <img class="card--course__thumb" src="courses/l43d.png" alt="Learning for 3D Vision">
  <div class="card--course__body">
    <div class="card--course__number">16-825</div>
    <h3 class="card--course__title">Learning for 3D Vision</h3>
    <p class="card--course__desc">Any autonomous agent we develop must perceive and act in a 3D world. The ability to infer, model, and utilize 3D representations is therefore of central importance in AI, with applications ranging from robotic manipulation and self-driving to virtual reality and image manipulation. While 3D understanding has been a longstanding goal in computer vision, it has witnessed several impressive advances due to the rapid recent progress in (deep) learning techniques. The goal of this course is to explore this confluence of 3D Vision and Learning-based methods.</p>
    <a class="card--course__link" href="https://learning3d.github.io/">Course Website &rarr;</a>
  </div>
</div>
<div class="card card--course">
  <img class="card--course__thumb" src="courses/16833.png" alt="Robot Localization and Mapping">
  <div class="card--course__body">
    <div class="card--course__number">16-833</div>
    <h3 class="card--course__title">Robot Localization and Mapping</h3>
    <p class="card--course__desc">This course focuses on the optimization aspects of state estimation, localization, and mapping. Localization and mapping are fundamental capabilities for mobile robots operating in the real world. Even more challenging than these individual problems is their combination: simultaneous localization and mapping (SLAM). Robust and scalable solutions are needed that can handle the uncertainty inherent in sensor measurements, while providing localization and map estimates in real-time. We will investigate suitable efficient probabilistic inference algorithms at the intersection of linear algebra and probabilistic graphical models. We will also explore some state-of-the-art systems.</p>
    <a class="card--course__link" href="https://www.cs.cmu.edu/~kaess/teaching/16833/">Course Website &rarr;</a>
  </div>
</div>
      </div>
    </div>
  </section>

  <!-- Footer -->
  <footer class="footer">
    <div class="footer__inner">
      <div class="footer__brand">
        <img src="./assets/cmu.svg" alt="Carnegie Mellon University">
        <p>School of Computer Science<br>5000 Forbes Ave<br>Pittsburgh, PA 15213</p>
      </div>
      <div class="footer__links">
        <h4>Pages</h4>
        <a href="./index.html">Overview</a>
        <a href="./people.html">People</a>
        <a href="./research.html">Research</a>
        <a href="./papers.html">Papers</a>
        <a href="./courses.html">Courses</a>
        <a href="./sponsorship.html">Sponsorship</a>
      </div>
    </div>
    <div class="footer__bottom">
      &copy; Carnegie Mellon University. All rights reserved.
    </div>
  </footer>

  <script src="./js/nav.js"></script>
</body>
</html>