P124 Speech signal processing

Faculty of Informatics
Autumn 1999
Extent and Intensity
2/1. 2 credit(s) (plus extra credits for completion). Recommended Type of Completion: zk (examination). Other types of completion: k (colloquium), z (credit).
Teacher(s)
prof. Dr. Ing. Jan Černocký (lecturer), doc. RNDr. Ivan Kopeček, CSc. (deputy)
Guaranteed by
prof. PhDr. Karel Pala, CSc.
Department of Machine Learning and Data Processing – Faculty of Informatics
Contact Person: doc. RNDr. Ivan Kopeček, CSc.
Course Enrolment Limitations
The course is also offered to the students of the fields other than those the course is directly associated with.
fields of study / plans the course is directly associated with
Syllabus
  • Informational contents of written and spoken form of speech.
  • Techniques of signal processing applied to speech: Fourier transform, z-transform, linear filtering.
  • Time domain and frequency domain behavoir of linear systems.
  • Signal processing model of speech production.
  • Excitation and filter.
  • Determination of parameters using linear prediction.
  • LPC coefficients and derived parameters (PARCOR, LAR,...).
  • Speech analysis using short-time Fourier transform (STFT): filter-bank interpretation, computation using fast Fourier transform (FFT).
  • Cepstral analysis.
  • Parameterization with perceptually warped frequency axis.
  • Fundamental frequency determination.
  • Features for speech processing, criteria of choice.
  • Measures of similarity between speech segments.
  • Speech coding: waveform and parametric vocoders.
  • Excitation modeling (CELP).
  • Phonetic vocoders.
  • Pattern recognition techniques in speech recognition.
  • Hidden Markov Models (HMM).
  • HMM training and HMM decoding.
  • Extension of HMMs to continuous speech recognition.
  • Statistical language models.
  • The studied methods are experimentally exercised in computer laboratories, in Matlab/Octave.
Literature
  • PSUTKA, Josef. Komunikace s počítačem mluvenou řečí. Praha: Academia, 1995, 287 s. ISBN 8020002030. info
  • RABINER, Lawrence R. and Biing-Hwang JUANG. Fundamentals of speech recognition. Englewood Cliffs: Prentice Hall PTR, 1993, xxxv, 507. ISBN 0-13-015157-2. info
Assessment methods (in Czech)
pisemna zkouska
Language of instruction
Czech
Further Comments
The course is taught annually.
The course is taught every week.
The course is also listed under the following terms Autumn 2000, Autumn 2001.
  • Enrolment Statistics (Autumn 1999, recent)
  • Permalink: https://is.muni.cz/course/fi/autumn1999/P124