VVZ API is not affiliated with ETH Zurich. Data might be outdated or incorrect. Please view the official ETHZ Vorlesungsverzeichnis for binding information.

263-5255-00L 5 Credits MSC , WBZ D-ITET , D-INFK , D-MATH

You're viewing possible stale or outdated data. Please check the latest semester for more up-to-date information.

Foundations of Reinforcement Learning

Lecturers & Examiners: Prof. Dr. Niao He

Number of participants limited to 190. Last cancellation/deregistration date for this graded semester performance: Thursday, 28 October 2021! Please note that after that date no deregistration will be accepted and the course will be considered as "fail".

VVZ CR 2.29

Last Updated: 2026-02-05 15:48:25

Abstract

Reinforcement learning (RL) has been in the limelight of many recent breakthroughs in artificial intelligence. This course focuses on theoretical and algorithmic foundations of reinforcement learning, through the lens of optimization, modern approximation, and learning theory. The course targets M.S. students with strong research interests in reinforcement learning, optimization, and control.

Objective

This course aims to provide students with an advanced introduction of RL theory and algorithms as well as bring them near the frontier of this active research field. By the end of the course, students will be able to - Identify the strengths and limitations of various reinforcement learning algorithms; - Formulate and solve sequential decision-making problems by applying relevant reinforcement learning tools; - Generalize or discover “new” applications, algorithms, or theories of reinforcement learning towards conducting independent research on the topic.

Content

Basic topics include fundamentals of Markov decision processes, approximate dynamic programming, linear programming and primal-dual perspectives of RL, model-based and model-free RL, policy gradient and actor-critic algorithms, Markov games and multi-agent RL. If time allows, we will also discuss advanced topics such as batch RL, inverse RL, causal RL, etc. The course keeps strong emphasis on in-depth understanding of the mathematical modeling and theoretical properties of RL algorithms.

Resources

Lecture Notes

Lecture notes will be posted on Moodle.

Literature

Dynamic Programming and Optimal Control, Vol I & II, Dimitris Bertsekas Reinforcement Learning: An Introduction, Second Edition, Richard Sutton and Andrew Barto. Algorithms for Reinforcement Learning, Csaba Czepesvári. Reinforcement Learning: Theory and Algorithms, Alekh Agarwal, Nan Jiang, Sham M. Kakade.

Learning Materials (Links)

Main link
Information

General Information

Language: English
Levels: MSC , WBZ
Frequency: Yearly recurring

Examination

Type: graded semester performance

project 60%, homework 40%

Registration & Places

Max Places: 190

Course Components

Type	Title	Time & Place	Hours
lecture	Foundations of Reinforcement Learning	Fri 14:15-16:00 (CAB G 11)	2 h weekly
independent project	Foundations of Reinforcement Learning	No time listed	2 h weekly

Offered In

Computer Science Master
- Master Studies (Programme Regulations 2020)
  - Majors
    - Major in Machine Intelligence
      - Elective Courses
  - Minors
    - Minor in Machine Learning
Mathematics Master
- Application Area (Only necessary and eligible for the Master degree in Applied Mathematics. One of the application areas specified must be selected for the category Application Area for the Master degree in Applied Mathematics. At least 8 credits are required in the chosen application area.)
  - Machine Learning (The list is not yet complete.)
Electrical Engineering and Information Technology Master
- Master Studies (Programme Regulations 2018)
  - Signal Processing and Machine Learning (The core courses and specialisation courses below are a selection for students who wish to specialise in the area of "Signal Processing and Machine Learning ", see . The individual study plan is subject to the tutor's approval.)
    - Specialisation Courses (These specialisation courses are particularly recommended for the area of "Signal Processing and Machine Learning", but you are free to choose courses from any other field in agreement with your tutor. A minimum of 40 credits must be obtained from specialisation courses during the MSc EEIT.)
- Master Studies (Programme Regulations 2008)
  - Major Courses (A total of 42 CP must be achieved during the Master Programme. The individual study plan is subject to the tutor's approval.)
    - Signal Processing and Machine Learning
      - Recommended Subjects
CAS in Computer Science
- Focus Courses and Electives
Data Science Master
- Core Courses
  - Core Electives
Cyber Security Master
- Minor
  - Machine Intelligence
    - Elective Courses