VVZ API is not affiliated with ETH Zurich. Data might be outdated or incorrect. Please view the official ETHZ Vorlesungsverzeichnis for binding information.
Large Language Models
Last Updated: 2026-06-03 00:14:10
Abstract
Large language models have become one of the most commonly deployed NLP inventions. In the past half-decade, their integration into core natural language processing tools has dramatically increased the performance of such tools, and they have entered the public discourse surrounding artificial intelligence.
Objective
To understand the mathematical foundations of large language models as well as how to implement them.
Content
We start with the probabilistic foundations of language models, i.e., covering what constitutes a language model from a formal, theoretical perspective. We then discuss how to construct and curate training corpora, and introduce many of the neural-network architectures often used to instantiate language models at scale. The course covers aspects of systems programming, discussion of privacy and harms, as well as applications of language models in NLP and beyond.
Resources
Literature
The lecture notes will be supplemented with various readings from the literature.
General Information
- Language
- English
- Levels
- MSC , WBZ
- Frequency
- Yearly recurring
Examination
- Type
- session examination
- Mode
- written 180 minutes
- Aids
- Two A4-pages (i.e. one A4-sheet of paper), either handwritten or 11. A simple non-programmable calculator.
Course Components
| Type | Title | Time & Place | Hours |
|---|---|---|---|
| lecture |
Large Language Models
Tuesday: course is held in HG E7 and will be transferred via videoconference to HG E3.
|
|
3 h weekly |
| exercise | Large Language Models |
|
2 h weekly |
| independent project | Large Language Models | No time listed | 2 h weekly |
Offered In
-
-
Electives (In the ‘electives’ subcategory, at least two course units must be successfully completed. All courses listed as core courses (not electives) for one of the following ETH MSc programmes, MSc Statistics, MSc Physics, MSc Computer Science, MSc (Applied) Mathematics, MSc Neural Systems and Computation, MSc Robotics, Systems, and Control, MSc Data Science, MSc Electrical Engineering and Information Technology, can be taken as an elective course in the MSc CSE without prior permission.)
-
-
-
-
-
Application Area (Only necessary and eligible for the Master degree in Applied Mathematics. One of the application areas specified must be selected for the category Application Area for the Master degree in Applied Mathematics. At least 8 credits are required in the chosen application area. Credits from other application areas cannot be recognised for further application areas.)
-
-
Statistics Master (The following courses belong to the curriculum of the Master's Programme in Statistics. The corresponding credits do not count as external credits even for course units where an enrolment at ETH Zurich is not possible.)
-
-
-
-
-
-
-