VVZ API is not affiliated with ETH Zurich. Data might be outdated or incorrect. Please view the official ETHZ Vorlesungsverzeichnis for binding information.

227-0678-00L 4 Credits

You're viewing possible stale or outdated data. Please check the latest semester for more up-to-date information.

Speech Processing II

Sprachverarbeitung II

Lecturers & Examiners: Beat Pfister, Dr. Christof Traber, Dr. René Beutler

VVZ CR n/a

Last Updated: 2026-02-05 15:02:43

Abstract

Advanced course in text-to-speech synthesis and speech recognition(continuation of course speech processing I).

Objective

Detailed knowledge about selected concepts and approaches to the problems of speech synthesis (mainly transcription part) and automatic speech recognition.

Content

Fundamentals of representation and application of linguistic knowledge: Introduction of the theory of formal languages, the Chomsky hierarchy, word analysis, finite state machines, parsing. Speech synthesis: Natural language analysis (for words and sentences), lexicon, grammar for natural language; generation of the abstract representation of pronunciation (phone sequence, accents, phrases). Additionally, the ETH text-to-speech system SVOX is discussed. Speech recognition: The statistical approach to speech recognition with hidden Markov models is detailed: Basic algorithms (forward, Viterbi and Baum-Welch algorithm), problems of implementation, HMM training, whole vs. subword modeling, isolated word recognition, continuous speech recognition, statistical and rule-based language models.

Resources

Lecture Notes

Yes (available at the assistence in ETZ D97.5)

General Information

Language: German
Frequency: Yearly recurring

Examination

Type: session examination
Mode: oral 30 minutes

Course Components

Type	Title	Time & Place	Hours
lecture with exercise	Sprachverarbeitung II	Fri 13:15-17:00 (ETZ E 8)	4 h weekly