Course detail
Speech Signal Processing
FIT-ZREAcad. year: 2020/2021
Applications of speech processing, digital processing of speech signals, production and perception of speech, introduction to phonetics, pre-processing and basic parameters of speech, linear-predictive model, cepstrum, fundamental frequency estimation, coding - time domain and vocoders, recognition - DTW and HMM, synthesis. Software and libraries for speech processing.
Guarantor
Learning outcomes of the course unit
Prerequisites
Co-requisites
Recommended optional programme components
Literature
Rabiner, L. R., & Schafer, R. W. Theory and applications of digital speech processing, Pearson, 2011.
Psutka, J., Müller, L., Matoušek, J., & Radová, V., Mluvíme s počítačem česky, Academia, 2006.
Yu, D., Deng, L., Automatic speech recognition, Springer, 2016.
Planned learning activities and teaching methods
Assesment methods and criteria linked to learning outcomes
- mid-term test 14 pts
- project 29 pts
- presentation of results in computer labs 6 pts
Language of instruction
Work placements
Aims
Classification of course in study plans
- Programme IT-MGR-2 Master's
branch MBI , any year of study, summer semester, 5 credits, compulsory-optional
branch MPV , any year of study, summer semester, 5 credits, compulsory-optional
branch MIS , any year of study, summer semester, 5 credits, elective
branch MBS , any year of study, summer semester, 5 credits, elective
branch MIN , any year of study, summer semester, 5 credits, compulsory-optional
branch MMM , any year of study, summer semester, 5 credits, elective - Programme MITAI Master's
specialization NADE , any year of study, summer semester, 5 credits, elective
specialization NBIO , any year of study, summer semester, 5 credits, elective
specialization NGRI , any year of study, summer semester, 5 credits, elective
specialization NNET , any year of study, summer semester, 5 credits, elective
specialization NVIZ , any year of study, summer semester, 5 credits, elective
specialization NCPS , any year of study, summer semester, 5 credits, elective
specialization NSEC , any year of study, summer semester, 5 credits, elective
specialization NEMB , any year of study, summer semester, 5 credits, elective
specialization NHPC , any year of study, summer semester, 5 credits, elective
specialization NISD , any year of study, summer semester, 5 credits, elective
specialization NIDE , any year of study, summer semester, 5 credits, elective
specialization NISY , any year of study, summer semester, 5 credits, elective
specialization NMAL , any year of study, summer semester, 5 credits, elective
specialization NMAT , any year of study, summer semester, 5 credits, elective
specialization NSEN , any year of study, summer semester, 5 credits, elective
specialization NVER , any year of study, summer semester, 5 credits, elective
specialization NSPE , any year of study, summer semester, 5 credits, compulsory - Programme IT-MGR-2 Master's
branch MGM , 1. year of study, summer semester, 5 credits, compulsory
branch MSK , 2. year of study, summer semester, 5 credits, compulsory-optional
Type of course unit
Lecture
Teacher / Lecturer
Syllabus
- Introduction, applications of speech processing.
- Digital processing of speech signals.
- Speech production and its signal processing model.
- Pre-processing and basic parameters of speech, cepstrum.
- Linear-predictive model.
- Fundamental frequency estimation.
- Speech coding - basics
- CELP Speech coding.
- Speech recognition - basics, DTW.
- Hidden Markov models HMM.
- Large vocabulary continuous speech recognition (LVCSR) systems.
- Speaker and language recognition. Neural networks in speech processing.
- Text to speech synthesis.
Fundamentals seminar
Teacher / Lecturer
Syllabus
- Parameterization, DTW, HMM.
Exercise in computer lab
Teacher / Lecturer
Syllabus
- Except the last one, Matlab is used in labs.
- Introduction.
- Linear prediction and vector quantization.
- Fundamental frequency estimation and speech coding.
- Basics of classification.
- Recognition - Dynamic time Warping (DTW).
- Recognition - hidden Markov models (HTK).