VVZ API is not affiliated with ETH Zurich. Data might be outdated or incorrect. Please view the official ETHZ Vorlesungsverzeichnis for binding information.

227-0678-00L 4 Credits

You're viewing possible stale or outdated data. Please check the latest semester for more up-to-date information.

Speech Processing II

Sprachverarbeitung II

Lecturers & Examiners: Prof. Dr. Hans-Peter Hutter, Beat Pfister, Dr. Christof Traber

VVZ CR n/a

Last Updated: 2026-02-05 14:57:26

Objective

Detailed knowledge about selected concepts and approaches to the problems of speech synthesis (mainly transcription part) and automatic speech recognition.

Content

Fundamentals of representation and application of linguistic knowledge: Introduction of the theory of formal languages, the Chomsky hierarchy, word analysis, finite state machines, parsing. Speech synthesis: Natural language analysis (for words and sentences), lexicon, grammar for natural language; generation of the abstract representation of pronunciation (phone sequence, accents, phrases). Additionally, the ETH text-to-speech system SVOX is discussed. Speech recognition: The statistical approach to speech recognition with hidden Markov models is detailed: Basic algorithms (forward, Viterbi and Baum-Welch algorithm), problems of implementation, HMM training, whole vs. subword modeling, isolated word recognition, continuous speech recognition, statistical and rule-based language models.

Resources

Lecture Notes

Yes (available at the assistence in ETZ D97.5)

General Information

Language: German
Frequency: Yearly recurring

Examination

Type: session examination
Mode: oral 30 minutes

Course Components

Type	Title	Time & Place	Hours
lecture with exercise	Sprachverarbeitung II	Fri 13:15-17:00 (ETZ E 8)	4 h weekly