Course detail
Classification and recognition
FIT-KRDAcad. year: 2021/2022
Estimation of parameters Maximum Likelihood and Expectation-Maximization, formulation of the objective function of discriminative training, Maximum Mutual information (MMI) criterion, adaptation of GMM models, transforms of features for recognition, modelling of feature space using discriminative sub-spaces, factor analysis, kernel techniques, calibration and fusion of classifiers, applications in recognition of speech, video and text.
State doctoral exam - topics:
- Maximum Likelihood estimation of parameters of a model
- Probability distribution from the exponential family and sufficient statistics
- Linear regression model and its probabilistic interpretation
- Bayesian models considering the probability distribution (uncertainty) of model parameters
- Conjugate priors and their significance in Bayesian models
- Fishers linear discriminant analysis
- Difference between generative and discriminative classifiers; their pros and cons
- Perceptron and its learning algorithm as an example of linear classifiers
- Generative linear classifier - Gaussian classifier with shared covariance matrix
- Discriminative classifier based on linear logistic regression
Guarantor
Learning outcomes of the course unit
The students will get acquainted with advanced classification and recognition techniques and learn how to apply basic methods in the fields of speech recognition, computer graphics and natural language processing.
The students will learn to solve general problems of classification and recognition.
Prerequisites
Basic knowledge of statistics, probability theory, mathematical analysis and algebra.
Co-requisites
Not applicable.
Recommended optional programme components
Not applicable.
Recommended or required reading
Christopher M. Bishop: Pattern Recognition and Machine Learning, Springer, 2006.
Ian Goodfellow, Yoshua Bengio, Aaron Courville: Deep Learning, MIT Press, 2016.
Simon Haykin: Neural Networks And Learning Machines, Pearson Education; Third edition, 2016.
Planned learning activities and teaching methods
Not applicable.
Assesment methods and criteria linked to learning outcomes
Not applicable.
Language of instruction
Czech, English
Work placements
Not applicable.
Aims
To understand advanced classification and recognition techniques and to learn how to apply the algorithms and methods to problems in speech recognition, computer graphics and natural language processing. To get acquainted with discriminative training and building hybrid systems.
Specification of controlled education, way of implementation and compensation for absences
Oral exam.
Classification of course in study plans
- Programme VTI-DR-4 Doctoral
branch DVI4 , any year of study, summer semester, 0 credits, elective
- Programme VTI-DR-4 Doctoral
branch DVI4 , any year of study, summer semester, 0 credits, elective
- Programme VTI-DR-4 Doctoral
branch DVI4 , any year of study, summer semester, 0 credits, elective
- Programme VTI-DR-4 Doctoral
branch DVI4 , any year of study, summer semester, 0 credits, elective
- Programme DIT Doctoral, any year of study, summer semester, 0 credits, compulsory-optional
- Programme DIT-EN Doctoral, any year of study, summer semester, 0 credits, compulsory-optional
- Programme DIT-EN Doctoral, any year of study, summer semester, 0 credits, compulsory-optional
- Programme DIT Doctoral, any year of study, summer semester, 0 credits, compulsory-optional
Type of course unit
Lecture
39 hours, optionally
Teacher / Lecturer
Syllabus
- Estimation of parameters of Gaussian probability distribution by Maximum Likelihood (ML)
- Estimation of parameters of Gaussian Gaussian Mixture Model (GMM) by Expectation-Maximization (EM)
- Discriminative training, introduction, formulation of the objective function
- Discriminative training with the Maximum Mutual information (MMI) criterion
- Adaptation of GMM models- Maximum A-Posteriori (MAP), Maximum Likelihood Linear Regression (MLLR)
- Transforms of features for recognition - basis, Principal component analysis (PCA)
- Discriminative transforms of features - Linear Discriminant Analysis (LDA) and Heteroscedastic Linear Discriminant Analysis (HLDA)
- Modeling of feature space using discriminative sub-spaces - factor analysis
- Kernel techniques, SVM
- Calibration and fusion of classifiers
- Applications in recognition of speech, video and text
- Student presentations I
- Student presentations II
Guided consultation in combined form of studies
26 hours, optionally
Teacher / Lecturer