VVZ API is not affiliated with ETH Zurich. Data might be outdated or incorrect. Please view the official ETHZ Vorlesungsverzeichnis for binding information.

227-0677-00L 6 Credits DS , MSC , WBZ D-ITET , D-INFK

Speech Processing I

Sprachverarbeitung I

Lecturers & Examiners: Beat Pfister
VVZ CR n/a

Last Updated: 2026-02-05 15:24:42

Abstract

Fundamentals of speech signal processing and introduction to text-to-speech synthesis and speech recognition.

Objective

Knowledge of the fundamentals of speech processing. Acquisition of practical experience in this field. Introduction to text-to-speech synthesis and speech recognition.

Content

Basic considerations of speech and language: Human communication, description of speech and language, human speech production and perception. Synopsis of speech and language processing topics. Analysis, represenation and properties of speech signals: Time and frequency domain representations, quasi-stationarity, formants, pitch, short-time analysis, spectrum, autocorrelation, linear prediction, homomorphic analysis. Fundamental problems of speech synthesis: Relations between text and speech; methods of speech production; prosody control. Fundamental problems of speech recognition: Variability of speech signals, speech features for speech recognition, pattern matching (distance measures, dynamic programming), and introduction to speech recognition with hidden Markov models.

Resources

Lecture Notes

The following textbook will be used: "Sprachverarbeitung - Grundlagen und Methoden der Sprachsynthese und Spracherkennung", B. Pfister und T. Kaufmann, Springer Verlag, ISBN: 978-3-540-75909-6

General Information

Language
German
Levels
DS , MSC , WBZ
Frequency
Yearly recurring

Examination

Type
session examination
Mode
oral 30 minutes

Course Components

Type Title Time & Place Hours
lecture with exercise Sprachverarbeitung I
  • Fri 13:15-17:00 (ETF C 1)
4 h weekly

Offered In