VVZ API is not affiliated with ETH Zurich. Data might be outdated or incorrect. Please view the official ETHZ Vorlesungsverzeichnis for binding information.
Machine Perception
Last Updated: 2026-06-01 11:33:07
Abstract
Recent developments in neural networks have drastically advanced the performance of machine perception systems in a variety of areas including computer vision, robotics, and human shape modeling.This course is a deep dive into deep learning algorithms and architectures with applications to a variety of perceptual and generative tasks.
Objective
Students will learn about fundamental aspects of modern deep learning approaches for perception and generation. Students will learn to implement, train and debug their own neural networks and gain a detailed understanding of cutting-edge research in learning-based computer vision, robotics, and shape modeling. The optional final project assignment will involve training a complex neural network architecture and applying it to a real-world dataset. The core competency acquired through this course is a solid foundation in deep-learning algorithms to process and interpret human-centric signals. In particular, students should be able to develop systems that deal with the problem of recognizing people in images, detecting and describing body parts, inferring their spatial configuration, performing action/gesture recognition from still images or image sequences, also considering multi-modal data, among others.
Content
The courses focuses on teaching how to set up the problem of machine perception and the associated learning algorithms, neural network architectures, and advanced deep learning concepts. The course covers the following main areas: I) Foundations of Deep Learning: Multilayer perceptrons, backpropagation, time-series modeling, convolutional neural networks. II) Advanced topics: latent variable models, generative adversarial networks, auto-regressive models, invertible neural networks, normalizing flows, diffusion models, neural implicit surface representations, neural radiance fields. III) Applications in machine perception and human-centric computer vision: general understanding of human activities, 3D reconstruction of human performance using different input modalities (monocular or multi-view images, body-worn sensors) and representations (explicit triangulated meshes, parametric body models, implicit surfaces, neural radiance fields, 3D Gaussian Splatting-based), Deep Reinforcement Learning and applications in physics-based behavior modeling.
Resources
Literature
Deep Learning Book by Ian Goodfellow, Yoshua Bengio, and Aaron Courville
Learning Materials (Links)
- Main link
- Information
General Information
- Language
- English
- Levels
- MSC , WBZ
- Frequency
- Yearly recurring
Examination
- Type
- end-of-semester examination
- Mode
- written 180 minutes
- Aids
- No electronic devices and calculators are allowed. A cheat sheet with the following restrictions is allowed: a maximum of 4 DIN A4 pages are allowed. They may be distributed over 2 double-sided sheets of paper or 4 sheets of paper written only on one side. Notes can be written digitally. It is not allowed to paste images or similar into the notes (both physical or digital) and making digital notes excessively small. If digital notes are typeset, the font size must be no smaller than 10 pt.
Registration & Places
- Max Places
- 300
Course Components
| Type | Title | Time & Place | Hours |
|---|---|---|---|
| lecture | Machine Perception |
|
3 h weekly |
| exercise | Machine Perception |
|
2 h weekly |
| independent project | Machine Perception | No time listed | 2 h weekly |
Offered In
-
-
-
-
Anwendungsgebiet (Nur für das Master-Diplom in Angewandter Mathematik erforderlich und anrechenbar. In der Kategorie Anwendungsgebiet für den Master in Angewandter Mathematik muss eines der zur Auswahl stehenden Anwendungsgebiete gewählt werden. Im gewählten Anwendungsgebiet müssen mindestens 8 KP erworben werden. Kreditpunkte aus anderen Anwendungsgebieten sind nicht für weitere Anwendungsgebiete anrechenbar.)
-
-
-
-
Vertiefung: Signal Processing and Machine Learning (The core courses and specialization courses below are a selection for students who wish to specialize in the area of "Signal Processing and Machine Learning ", see . The individual study plan is subject to the tutor's approval.)
-
Vertiefungsfächer (These specialization courses are particularly recommended for the area of "Signal Processing and Machine Learning", but you are free to choose courses from any other field in agreement with your tutor. Semester / Research Projects are not allowed in this category. A minimum of 40 credits must be obtained from specialization courses during the MSc EEIT.)
-
-
-
-
-
-
-
-
-
-
-
-
-
Fachspezifische Vertiefung (Es müssen mindestens 20 KP aus den Deep Track Lerneinheiten absolviert werden. Überzählige KP können für Wahlfächer angerechnet werden.)
-
-
Vertiefungsfächer Robotics (Diese LE's können sowohl als Vertiefungsfach als auch als Wahlfach angerechnet werden.)
-
-
-