Detail projektu

Zdroje financování

Grantová agentura České republiky - Standardní projekty

O projektu

Projekt o Rozpoznávání mluvené řeči v reálných podmínkách. Projekt navazuje na předchozí grantově podporovaný výzkum, v němž se řešitelskému týmu podařilo vyvinout a částečně i realizovat základní metody rozpoznávání řeči v českém jazyce. Aby však mohly být úspěšně nasazeny v nejvíce žádaných aplikacích, jako jsou přepisy hovorů, záznamů diskusí nebo jednání v soudních síních, musí být pozornost zaměřena na analýzu a modelování běžné mluvené (hovorové) řeči zaznamenávané v reálných podmínkách za přítomnosti šumu, hluků, případně dalších mluvících osob.

Popis anglicky
This project follows preceding research projects within which the participating teams developed and implemented basic speech recognition algorithms for Czech. For their successful use in the most challenging applications, such as transcription of talks, recordings of court-hearings, etc., the research must continue in analysis and modelling of colloquial speech recorded in real conditions (e.g. with different backgrounds, noises, or with cross-talk). The main goal of this four-year project is to design and test new speech feature extraction techniques, background or noise suppression, speaker change-point detection, quick adaptation to new speaker characteristics, to improve lexical and phonetic inventory of recognition systems for colloquial speech, and also to develop language models with better coverage of inflective nature of Czech. This project will contribute to advancing the state-of-the-art in basic research of speech recognition and it will facilitate the integration of involved teams into European research community.

Klíčová slova
rozpoznávání řeči

Klíčová slova anglicky
speech recognition

Označení

GA102/08/0707

Originální jazyk

čeština

Řešitelé

Pollák Petr - hlavní řešitel
Burget Lukáš, doc. Ing., Ph.D. - spoluřešitel
Černocký Jan, prof. Dr. Ing. - spoluřešitel
Matějka Pavel, Ing., Ph.D. - spoluřešitel
Schwarz Petr, Ing., Ph.D. - spoluřešitel

Útvary

Fakulta informačních technologií
- odpovědné pracoviště (13.5.2011 - nezadáno)
Ústav počítačové grafiky a multimédií
- odpovědné pracoviště (10.4.2008 - nezadáno)
Výzkumná skupina dolování dat z řeči BUT Speech@FIT
- interní (1.1.2008 - 31.12.2011)
Ústav počítačové grafiky a multimédií
- spolupříjemce (1.1.2008 - 31.12.2011)
Fakulta informačních technologií
- příjemce (13.5.2011 - nezadáno)

Výsledky

VESELÝ, K.: Neural Network Trainer TNet. URL: http://speech.fit.vutbr.cz/en/software/neural-network-trainer-tnet. (Software)
Detail

PLCHOT, O.; HUBEIKA, V.; BURGET, L.; SCHWARZ, P.; MATĚJKA, P. Acquisition of Telephone Data from Radio Broadcasts with Applications to Language Recognition. Proc. 11th International Conference on Text, Speech and Dialogue. Berlin: Springer Verlag, 2008. p. 477.ISBN: 978-3-540-87390-7.
Detail

GLEMBEK, O.; BURGET, L.; DEHAK, N.; BRÜMMER, N.; KENNY, P. Comparison of Scoring Methods used in Speaker Recognition with Joint Factor Analysis. Proc. ICASSP 2009. Taipei: IEEE Signal Processing Society, 2009. p. 1.ISBN: 978-1-4244-2354-5.
Detail

MIKOLOV, T.; KOMBRINK, S.; BURGET, L.; ČERNOCKÝ, J.; KHUDANPUR, S. Extensions of Recurrent Neural Network Language Model. Proceedings of the 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011. Praha: IEEE Signal Processing Society, 2011. p. 5528.ISBN: 978-1-4577-0537-3.
Detail

KOCKMANN, M.; BURGET, L.; ČERNOCKÝ, J. Brno University of Technology System for Interspeech 2009 Emotion Challenge. Proc. Interspeech 2009. Proceedings of Interspeech. Brighton: International Speech Communication Association, 2009. iss. 9, p. 348.ISSN: 1990-9772.
Detail

KOMBRINK, S.; HANNEMANN, M.; BURGET, L. Out-of-Vocabulary Word Detection and Beyond. In Detection and Identification of Rare Audiovisual Cues. Studies in Computational Intelligence, 384. Springer-Verlag Berlin Heidelberg: Springer Verlag, 2012. p. 57.ISBN: 978-3-642-24033-1.
Detail

BURGET, L.; PLCHOT, O.; CUMANI, S.; GLEMBEK, O.; MATĚJKA, P.; BRÜMMER, N. Discriminatively Trained Probabilistic Linear Discriminant Analysis for Speaker Verification. In Proceedings of the 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011. Praha: IEEE Signal Processing Society, 2011. p. 4832-4835. ISBN: 978-1-4577-0537-3.
Detail

KARAFIÁT, M.; SZŐKE, I.; ČERNOCKÝ, J. Using Gradient Descent Optimization for Acoustic Training from Heterogeneous Data. Proc. Text, Speech and Dialog 2010. Lecture Notes in Computer Science. LNAI 6231. Brno: Springer Verlag, 2010. iss. 9, p. 322.ISBN: 978-3-642-15759-2. ISSN: 0302-9743.
Detail

GRÉZL, F.; FOUSEK, P. Optimizing bottle-neck features for LVCSR. 2008 IEEE International Conference on Acoustics, Speech, and Signal Processing. Las Vegas, Nevada: IEEE Signal Processing Society, 2008. p. 4729.ISBN: 1-4244-1484-9.
Detail

CUMANI, S.; PLCHOT, O.; KARAFIÁT, M. Independent Component Analysis and MLLR Transforms for Speaker Identification. Proc. International Conference on Acoustics, Speech, and Signal P. Kyoto: IEEE Signal Processing Society, 2012. p. 4365.ISBN: 978-1-4673-0044-5.
Detail

GLEMBEK, O.; MATĚJKA, P.; BURGET, L.; MIKOLOV, T. Advances in Phonotactic Language Recognition. Proc. Interspeech 2008. Proceedings of Interspeech. Brisbane: International Speech Communication Association, 2008. iss. 9, p. 1.ISSN: 1990-9772.
Detail

BURGET, L.; FAPŠO, M.; HUBEIKA, V.; GLEMBEK, O.; KARAFIÁT, M.; KOCKMANN, M.; MATĚJKA, P.; SCHWARZ, P.; ČERNOCKÝ, J. BUT system description: NIST SRE 2008. Proc. 2008 NIST Speaker Recognition Evaluation Workshop. Montreal: National Institute of Standards and Technology, 2008. p. 1.
Detail

BRÜMMER, N.; STRASHEIM, A.; HUBEIKA, V.; MATĚJKA, P.; BURGET, L.; GLEMBEK, O. Discriminative Acoustic Language Recognition via Channel-Compensated GMM Statistics. Proc. Interspeech 2009. Proceedings of Interspeech. Brighton: International Speech Communication Association, 2009. iss. 9, p. 2187.ISBN: 978-1-61567-692-7. ISSN: 1990-9772.
Detail

SZŐKE, I.; GRÉZL, F.; ČERNOCKÝ, J.; FAPŠO, M. Acoustic keyword spotter - optimization from end-user perspective. Proceedings of the 2010 IEEE Spoken Language Technology Workshop. IEEE Catalog Number: CFP 10SLT-USB. Berkeley, California: IEEE Signal Processing Society, 2010. p. 177.ISBN: 978-1-4244-7902-3.
Detail

DEORAS, A.; MIKOLOV, T.; KOMBRINK, S.; CHURCH, K. Approximate inference: A sampling based modeling technique to capture complex dependencies in a language model. Speech communication, 2012, vol. 2012, iss. 8, p. 1-16. ISSN: 0167-6393.
Detail

BRÜMMER, N.; BURGET, L.; GLEMBEK, O.; HUBEIKA, V.; JANČÍK, Z.; KARAFIÁT, M.; MATĚJKA, P.; MIKOLOV, T.; PLCHOT, O.; STRASHEIM, A. BUT-AGNITIO System Description for NIST Language Recognition Evaluation 2009. Proceedings NIST 2009 Language Recognition Evaluation Workshop. Baltimore, Maryland, USA: National Institute of Standards and Technology, 2009. p. 1.
Detail

HAIN, T.; BURGET, L.; DINES, J.; GARNER, P.; GRÉZL, F.; EL HANNANI, A.; HUIJBREGTS, M.; KARAFIÁT, M.; LINCOLN, M.; WAN, V. Transcribing Meetings with the AMIDA System. IEEE Transactions on Audio, Speech, and Language Processing, 2012, vol. 20, iss. 2, p. 486-498. ISSN: 1558-7916.
Detail

MIKOLOV, T.; DEORAS, A.; POVEY, D.; BURGET, L.; ČERNOCKÝ, J. Strategies for Training Large Scale Neural Network Language Models. Proceedings of ASRU 2011. Hilton Waikoloa Village, Big Island, Hawaii: IEEE Signal Processing Society, 2011. p. 196.ISBN: 978-1-4673-0366-8.
Detail

MIKOLOV, T. Language models for automatic speech recognition of Czech lectures. Proc. STUDENT EEICT 2008. Brno: Faculty of Electrical Engineering and Communication BUT, 2008. p. 1.ISBN: 978-80-214-3617-6.
Detail

KOCKMANN, M.; BURGET, L. Syllable based Feature-Contours for Speaker Recognition. Proc. 14th International Workshop on Advances in Speech Technology. Maribor: 2008. p. 1.
Detail

MIKOLOV, T.; PLCHOT, O.; GLEMBEK, O.; MATĚJKA, P.; BURGET, L.; ČERNOCKÝ, J. PCA-based Feature Extraction for Phonotactic Language Recognition. In Proc. Odyssey 2010 - The Speaker and Language Recognition Workshop. Brno: International Speech Communication Association, 2010. p. 251-255. ISBN: 978-80-214-4114-9.
Detail

KOCKMANN, M.; BURGET, L. Contour modeling of prosodic and acoustic features for speaker recognition. Proc. 2008 IEEE Workshop on Spoken Language Technology. Goa: IEEE Signal Processing Society, 2008. p. 1.ISBN: 978-1-4244-3472-5.
Detail

BURGET, L.; SCHWARZ, P.; MATĚJKA, P.; HANNEMANN, M.; RASTROW, A.; WHITE, C.; KHUDANPUR, S.; HEŘMANSKÝ, H.; ČERNOCKÝ, J. Combination of strongly and weakly constrained recognizers for reliable detection of OOVs. Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Las Vegas: IEEE Signal Processing Society, 2008. p. 1.ISBN: 1-4244-1484-9.
Detail

POVEY, D.; HANNEMANN, M.; BOULIANNE, G.; BURGET, L.; GHOSHAL, A.; JANDA, M.; KARAFIÁT, M.; KOMBRINK, S.; MOTLÍČEK, P.; QIAN, Y.; RIEDHAMMER, K.; VESELÝ, K.; VU, N. Generating Exact Lattices in The WFST Framework. Proceedings of 2012 IEEE International Conference on Acoustics, Speech and Signal Processing. Kyoto: IEEE Signal Processing Society, 2012. p. 4213.ISBN: 978-1-4673-0044-5.
Detail

GRÉZL, F.; KARAFIÁT, M.; BURGET, L. Investigation into bottle-neck features for meeting speech recognition. Proc. Interspeech 2009. Proceedings of Interspeech. Brighton: International Speech Communication Association, 2009. iss. 9, p. 2947.ISBN: 978-1-61567-692-7. ISSN: 1990-9772.
Detail

DEORAS, A.; MIKOLOV, T.; CHURCH, K. A Fast Re-scoring Strategy to Capture Long-Distance Dependencies. Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing July 2011 Edinburgh, Scotland, UK. Edinburgh: Association for Computational Linguistics, 2011. p. 1116.ISBN: 978-1-937284-11-4.
Detail

SANTHOSH KUMAR, C.; LI, H.; TONG, R.; MATĚJKA, P.; BURGET, L.; ČERNOCKÝ, J. Tuning phone decoders for language identification. Proc. International Conference on Acoustics, Speech, and Signal Processing 2010. Proceedings of the ... IEEE International Conference on Acoustics, Speech, and Signal Processing. Dallas: IEEE Signal Processing Society, 2010. iss. 3, p. 5010.ISBN: 978-1-4244-4296-6. ISSN: 1520-6149.
Detail

KOCKMANN, M.; FERRER, L.; BURGET, L.; SHRIBERG, E.; ČERNOCKÝ, J. Recent Progress in Prosodic Speaker Verification. Proceedings of the 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011. Praha: IEEE Signal Processing Society, 2011. p. 4556.ISBN: 978-1-4577-0537-3.
Detail

KOMBRINK, S.; HANNEMANN, M.; BURGET, L. Out-of-vocabulary word detection and beyond. ECML PKDD 2010 Proceedings and Journal Content. Barcelona: 2010. p. 1.
Detail

KOMBRINK, S.; BURGET, L.; MATĚJKA, P.; KARAFIÁT, M.; HEŘMANSKÝ, H. Posterior-based Out of Vocabulary Word Detection in Telephone Speech. Proc. Interspeech 2009. Proceedings of Interspeech. Brighton: International Speech Communication Association, 2009. iss. 9, p. 80.ISSN: 1990-9772.
Detail

GRÉZL, F.; KARAFIÁT, M. Integrating recent MLP feature extraction techniques into TRAP architecture. Proceedings of Interspeech 2011. Proceedings of Interspeech. Florence: International Speech Communication Association, 2011. iss. 8, p. 1229.ISBN: 978-1-61839-270-1. ISSN: 1990-9772.
Detail

GRÉZL, F.; ČERNOCKÝ, J. Audio Surveillance through Known Event Classification. Radioengineering, 2009, vol. 18, iss. 4, p. 671-675. ISSN: 1210-2512.
Detail

GLEMBEK, O.; BURGET, L.; KENNY, P.; KARAFIÁT, M.; MATĚJKA, P. Simplification and optimization of I-Vector Extraction. Proceedings of the 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011. Praha: IEEE Signal Processing Society, 2011. p. 4516.ISBN: 978-1-4577-0537-3.
Detail

KOCKMANN, M.; BURGET, L.; ČERNOCKÝ, J. Investigations into prosodic syllable contour features for speaker recognition. Proc. International Conference on Acoustics, Speech, and Signal Processing. Proceedings of the ... IEEE International Conference on Acoustics, Speech, and Signal Processing. Dallas: IEEE Signal Processing Society, 2010. iss. 3, p. 4418.ISBN: 978-1-4244-4296-6. ISSN: 1520-6149.
Detail

HAIN, T.; BURGET, L.; DINES, J.; GARNER, P.; EL HANNANI, A.; HUIJBREGTS, M.; KARAFIÁT, M.; LINCOLN, M.; WAN, V. The AMIDA 2009 Meeting Transcription System. Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010). Proceedings of Interspeech. Makuhari, Chiba: International Speech Communication Association, 2010. iss. 9, p. 358.ISBN: 978-1-61782-123-3. ISSN: 1990-9772.
Detail

BURGET, L.; FAPŠO, M.; HUBEIKA, V.; GLEMBEK, O.; KARAFIÁT, M.; KOCKMANN, M.; MATĚJKA, P.; SCHWARZ, P.; ČERNOCKÝ, J. Brno University Of Technology - NIST 2008 SRE. Montreal: 2008. p. 1.
Detail

KOCKMANN, M.; BURGET, L.; ČERNOCKÝ, J. Application of speaker- and language identification state-of-the-art techniques for emotion recognition. Speech communication, 2011, vol. 53, iss. 9, p. 1172-1185. ISSN: 0167-6393.
Detail

SZŐKE, I.; FAPŠO, M.; BURGET, L.; ČERNOCKÝ, J. Hybrid word-subword decoding for spoken term detection. Proc. SSCS 2008: Speech search workshop at SIGIR. Singapore: Association for Computing Machinery, 2008. p. 1.ISBN: 978-90-365-2697-5.
Detail

MIKOLOV, T.; KARAFIÁT, M.; BURGET, L.; ČERNOCKÝ, J.; KHUDANPUR, S. Recurrent neural network based language model. Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010). Proceedings of Interspeech. Makuhari, Chiba: International Speech Communication Association, 2010. iss. 9, p. 1045.ISBN: 978-1-61782-123-3. ISSN: 1990-9772.
Detail

HUBEIKA, V.; BURGET, L.; MATĚJKA, P.; SCHWARZ, P. Discriminative Training and Channel Compensation for Acoustic Language Recognition. Proc. Interspeech 2008. Proceedings of Interspeech. Brisbane: International Speech Communication Association, 2008. iss. 9, p. 1.ISSN: 1990-9772.
Detail

VESELÝ, K.; BURGET, L.; GRÉZL, F. Parallel Training of Neural Networks for Speech Recognition. Prof. Text, Speech and Dialogue 2010. Lecture Notes in Computer Science. LNAI 6231. Brno: Springer Verlag, 2010. iss. 9, p. 439.ISBN: 978-3-642-15759-2. ISSN: 0302-9743.
Detail

GRÉZL, F. The Role of Neural Network Size in TRAP/HATS Feature Extraction. Proceedings Text, Speech and Dialogue 2011. Lecture Notes in Computer Science. LNAI 6836. Plzeň: Springer Verlag, 2011. iss. 9, p. 315.ISBN: 978-3-642-23537-5. ISSN: 0302-9743.
Detail

MATĚJKA, P.; BURGET, L.; GLEMBEK, O.; SCHWARZ, P.; HUBEIKA, V.; FAPŠO, M.; MIKOLOV, T.; PLCHOT, O.; ČERNOCKÝ, J. BUT language recognition system for NIST 2007 evaluations. Proc. Interspeech 2008. Proceedings of Interspeech. Brisbane, Australia: International Speech Communication Association, 2008. iss. 9, p. 1.ISSN: 1990-9772.
Detail

HANNEMANN, M.; KOMBRINK, S.; KARAFIÁT, M.; BURGET, L. Similarity Scoring for Recognizing Repeated Out-of-VocabularyWords. Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010). Proceedings of Interspeech. Makuhari, Chiba: International Speech Communication Association, 2010. iss. 9, p. 897.ISBN: 978-1-61782-123-3. ISSN: 1990-9772.
Detail

DEORAS, A.; MIKOLOV, T.; KOMBRINK, S.; KARAFIÁT, M.; KHUDANPUR, S. Variational Approximation of Long-span Language Models for LVCSR. Proceedings of the 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011. Praha: IEEE Signal Processing Society, 2011. p. 5532.ISBN: 978-1-4577-0537-3.
Detail

KOCKMANN, M.; FERRER, L.; BURGET, L.; ČERNOCKÝ, J. iVector Fusion of Prosodic and Cepstral Features for Speaker Verification. Proceedings of Interspeech 2011. Proceedings of Interspeech. Florence: International Speech Communication Association, 2011. iss. 8, p. 265.ISBN: 978-1-61839-270-1. ISSN: 1990-9772.
Detail

VESELÝ, K.; BURGET, L.; GRÉZL, F. Parallel Training of Neural Networks for Speech Recognition. Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010). Proceedings of Interspeech. Makuhari, Chiba: International Speech Communication Association, 2010. iss. 9, p. 2934.ISSN: 1990-9772.
Detail

MATĚJKA, P.; GLEMBEK, O.; CASTALDO, F.; ALAM, J.; PLCHOT, O.; KENNY, P.; BURGET, L.; ČERNOCKÝ, J. Full-covariance UBM and Heavy-tailed PLDA in I-Vector Speaker Verification. In Proceedings of the 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011. Praha: IEEE Signal Processing Society, 2011. p. 4828-4831. ISBN: 978-1-4577-0537-3.
Detail

KARAFIÁT, M.; BURGET, L.; MATĚJKA, P.; GLEMBEK, O.; ČERNOCKÝ, J. iVector-Based Discriminative Adaptation for Automatic Speech Recognition. Proceedings of ASRU 2011. Hilton Waikoloa Village, Big Island, Hawaii: IEEE Signal Processing Society, 2011. p. 152.ISBN: 978-1-4673-0366-8.
Detail

JANČÍK, Z.; PLCHOT, O.; BRUMMER, J.; BURGET, L.; GLEMBEK, O.; HUBEIKA, V.; KARAFIÁT, M.; MATĚJKA, P.; MIKOLOV, T.; STRASHEIM, A.; ČERNOCKÝ, J. Data selection and calibration issues in automatic language recognition - investigation with BUT-AGNITIO NIST LRE 2009 system. In Proc. Odyssey 2010 - The Speaker and Language Recognition Workshop. Brno: International Speech Communication Association, 2010. p. 215-221. ISBN: 978-80-214-4114-9.
Detail

MIKOLOV, T.; KOMBRINK, S.; DEORAS, A.; BURGET, L.; ČERNOCKÝ, J. RNNLM - Recurrent Neural Network Language Modeling Toolkit. Proceedings of ASRU 2011. Hilton Waikoloa Village, Big Island, Hawaii: IEEE Signal Processing Society, 2011. p. 1.ISBN: 978-1-4673-0366-8.
Detail

BURGET, L.; FAPŠO, M.; HUBEIKA, V.; GLEMBEK, O.; KARAFIÁT, M.; KOCKMANN, M.; MATĚJKA, P.; SCHWARZ, P.; ČERNOCKÝ, J. BUT system for NIST 2008 speaker recognition evaluation. Proc. Interspeech 2009. Proceedings of Interspeech. Brighton: International Speech Communication Association, 2009. iss. 9, p. 2335.ISBN: 978-1-61567-692-7. ISSN: 1990-9772.
Detail

MIKOLOV, T.; DEORAS, A.; KOMBRINK, S.; BURGET, L.; ČERNOCKÝ, J. Empirical Evaluation and Combination of Advanced Language Modeling Techniques. Proceedings of Interspeech 2011. Proceedings of Interspeech. Florence: International Speech Communication Association, 2011. iss. 8, p. 605.ISBN: 978-1-61839-270-1. ISSN: 1990-9772.
Detail

KARAFIÁT, M. Study of linear transformations applied to training of cross-domain adapted large vocabulary continuous speech recognition systems. Brno: 2009. 73 p.
Detail

POVEY, D.; BURGET, L.; AGARWAL, M.; AKYAZI, P.; GHOSHAL, A.; GLEMBEK, O.; GOEL, N.; KARAFIÁT, M.; RASTROW, A.; ROSE, R.; SCHWARZ, P.; THOMAS, S. The subspace Gaussian mixture model-A structured model for speech recognition. COMPUTER SPEECH AND LANGUAGE, 2011, vol. 25, iss. 2, p. 404-439. ISSN: 0885-2308.
Detail

SZŐKE, I.; BURGET, L.; ČERNOCKÝ, J.; FAPŠO, M. Sub-word modeling of out of vocabulary words in spoken term detection. Proc. 2008 IEEE Workshop on Spoken Language Technology. Goa: IEEE Signal Processing Society, 2008. p. 1.ISBN: 978-1-4244-3472-5.
Detail

VESELÝ, K.; KARAFIÁT, M.; GRÉZL, F. Convolutive Bottleneck Network Features for LVCSR. Proceedings of ASRU 2011. Big Island, Hawaii: IEEE Signal Processing Society, 2011. p. 42.ISBN: 978-1-4673-0366-8.
Detail

PEŠÁN, J. Rozpoznávání mluvčího na mobilním telefonu. Proceedings of the 17th Conference Student EEICT 2011. Volume 2. Brno: Vysoké učení technické v Brně, 2011. s. 341.ISBN: 978-80-214-4272-6.
Detail

KOMBRINK, S.; HANNEMANN, M.; BURGET, L.; HEŘMANSKÝ, H. Recovery of Rare Words in Lecture Speech. Proc. Text, Speech and Dialogue 2010. Lecture Notes in Computer Science. Brno: Springer Verlag, 2010. iss. 9, p. 330.ISBN: 978-3-642-15759-2. ISSN: 0302-9743.
Detail

GRÉZL, F.; KARAFIÁT, M. Hierarchical Neural Net Architectures for Feature Extraction in ASR. Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010). Proceedings of Interspeech. Makuhari, Chiba: International Speech Communication Association, 2010. iss. 9, p. 1201.ISBN: 978-1-61782-123-3. ISSN: 1990-9772.
Detail

SCHWARZ, P. Phoneme recognition based on long temporal context. Brno: Faculty of Information Technology BUT, 2009. p. 1.
Detail

GRÉZL, F.; KARAFIÁT, M.; JANDA, M. Study of Probabilistic and Bottle-Neck Features in Multilingual Environment. Proceedings of ASRU 2011. Hilton Waikoloa Village, Big Island, Hawaii: IEEE Signal Processing Society, 2011. p. 359.ISBN: 978-1-4673-0366-8.
Detail

KOMBRINK, S.; MIKOLOV, T. Recurrent Neural Network Language Modeling Applied to the Brno AMI/AMIDA 2009 Meeting Recognizer Setup. Proceedings of the 17th Conference STUDENT EEICT 2011. Volume 3. Brno: Brno University of Technology, 2011. p. 527.ISBN: 978-80-214-4273-3.
Detail

KARAFIÁT, M.; BURGET, L.; HAIN, T.; ČERNOCKÝ, J. Discrimininative training of narrow band - wide band adaptated systems for meeting recognition. Proc. Interspeech 2008. Proceedings of Interspeech. Brisbane: International Speech Communication Association, 2008. iss. 9, p. 1.ISSN: 1990-9772.
Detail

BRUMMER, J.; BURGET, L.; KENNY, P.; MATĚJKA, P.; DE VILLIERS, E.; KARAFIÁT, M.; KOCKMANN, M.; GLEMBEK, O.; PLCHOT, O.; BAUM, D.; SENOUSSAUOI, M. ABC System description for NIST SRE 2010. Proc. NIST 2010 Speaker Recognition Evaluation. Brno: National Institute of Standards and Technology, 2010. p. 1.
Detail

CUMANI, S.; BRÜMMER, N.; BURGET, L.; LAFACE, P. Fast Discriminative Speaker Verification in the I-Vector Space. Proceedings of the 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011. Praha: IEEE Signal Processing Society, 2011. p. 4852.ISBN: 978-1-4577-0537-3.
Detail

KOCKMANN, M.; BURGET, L.; GLEMBEK, O.; FERRER, L.; ČERNOCKÝ, J. Prosodic Speaker Verification using Subspace Multinomial Models with Intersession Compensation. Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010). Proceedings of Interspeech. Makuhari, Chiba, Japan: International Speech Communication Association, 2010. iss. 9, p. 1061.ISBN: 978-1-61782-123-3. ISSN: 1990-9772.
Detail

KOCKMANN, M.; BURGET, L.; ČERNOCKÝ, J. Brno University of Technology System for Interspeech 2010 Paralinguistic Challenge. Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010). Proceedings of Interspeech. Makuhari, Chiba: International Speech Communication Association, 2010. iss. 9, p. 2822.ISBN: 978-1-61782-123-3. ISSN: 1990-9772.
Detail

BURGET, L.; MATĚJKA, P.; HUBEIKA, V.; ČERNOCKÝ, J. Investigation into variants of Joint Factor Analysis for speaker recognition. Proc. Interspeech 2009. Proceedings of Interspeech. Brighton: International Speech Communication Association, 2009. iss. 9, p. 1263.ISBN: 978-1-61567-692-7. ISSN: 1990-9772.
Detail

KOMBRINK, S.; MIKOLOV, T.; KARAFIÁT, M.; BURGET, L. Recurrent Neural Network based Language Modeling in Meeting Recognition. Proceedings of Interspeech 2011. Proceedings of Interspeech. Florence: International Speech Communication Association, 2011. iss. 8, p. 2877.ISBN: 978-1-61839-270-1. ISSN: 1990-9772.
Detail

BOŘIL, H.; GRÉZL, F.; HANSEN, J. Front-End Compensation Methods for LVCSR Under Lombard Effect. Proceedings of Interspeech 2011. Proceedings of Interspeech. Florence: International Speech Communication Association, 2011. iss. 8, p. 1257.ISBN: 978-1-61839-270-1. ISSN: 1990-9772.
Detail

VESELÝ, K. Parallel training of neural networks for speech recognition. Proceedings of the 16th Conference STUDENT EEICT 2010. Volume 3. Brno: Brno University of Technology, 2010. p. 74.ISBN: 978-80-214-4078-4.
Detail

Odkaz

http://noel.feld.cvut.cz/gacr0811/cz/abstract/abstract.php

Odpovědnost: Pollák Petr

VUT

Fakulty a vysokoškolské ústavy

Součásti

Rozpoznávání mluvené řeči v reálných podmínkách