Course detail
Speech Signal Processing
FIT-ZREAcad. year: 2018/2019
Applications of speech processing, digital processing of speech signals, production and perception of speech, introduction to phonetics, pre-processing and basic parameters of speech, linear-predictive model, cepstrum, fundamental frequency estimation, coding - time domain and vocoders, recognition - DTW and HMM, synthesis. Software and libraries for speech processing.
Language of instruction
Number of ECTS credits
Mode of study
Guarantor
Learning outcomes of the course unit
Prerequisites
Co-requisites
Planned learning activities and teaching methods
Assesment methods and criteria linked to learning outcomes
- mid-term test 14 pts
- project 29 pts
- presentation of results in computer labs 6 pts
Course curriculum
Work placements
Aims
Specification of controlled education, way of implementation and compensation for absences
Recommended optional programme components
Prerequisites and corequisites
Basic literature
Recommended reading
Psutka, J.: Komunikace s počítačem mluvenou řečí. Academia, Praha, 1995, ISBN 80-200-0203-0
Rabiner, L., Juang, B.H.: Fundamentals of Speech Recognition, Signal Processing, Prentice Hall, Engelwood Cliffs, NJ, 1993, ISBN 0-13-015157-2
Classification of course in study plans
- Programme IT-MSC-2 Master's
branch MBI , 0 year of study, summer semester, compulsory-optional
branch MSK , 2 year of study, summer semester, compulsory-optional
branch MMM , 0 year of study, summer semester, elective
branch MBS , 0 year of study, summer semester, elective
branch MPV , 0 year of study, summer semester, compulsory-optional
branch MIS , 0 year of study, summer semester, elective
branch MIN , 0 year of study, summer semester, compulsory-optional
branch MGM , 1 year of study, summer semester, compulsory
Type of course unit
Lecture
Teacher / Lecturer
Syllabus
- Introduction, applications of speech processing.
- Digital processing of speech signals.
- Speech production and its signal processing model.
- Pre-processing and basic parameters of speech, cepstrum.
- Linear-predictive model.
- Fundamental frequency estimation.
- Speech coding - basics
- CELP Speech coding.
- Speech recognition - basics, DTW.
- Hidden Markov models HMM.
- Large vocabulary continuous speech recognition (LVCSR) systems.
- Speaker and language recognition. Neural networks in speech processing.
- Text to speech synthesis.
Fundamentals seminar
Teacher / Lecturer
Syllabus
- Parameterization, DTW, HMM.
Exercise in computer lab
Teacher / Lecturer
Syllabus
- Except the last one, Matlab is used in labs.
- Introduction.
- Linear prediction and vector quantization.
- Fundamental frequency estimation and speech coding.
- Basics of classification.
- Recognition - Dynamic time Warping (DTW).
- Recognition - hidden Markov models (HTK).