Detail projektu

Moderní metody zpracování, analýzy a zobrazování multimediálních a 3D dat

Období řešení: 1.3.2020 — 28.2.2023

Zdroje financování

Vysoké učení technické v Brně - Vnitřní projekty VUT

O projektu

Multimediální a 3D data jsou důležitými a potřebnými daty pro řadu aplikací moderních počítačových systémů, kde je jejich využití nenahraditelné. Současně je dlouhodobě známo, že zpracování takových dat je obtížné a výpočetně náročné a to platí i o jejich zobrazování a zejména analýze. Proto je výzkum v této oblasti jedním z obtížnějších a též, vzhledem k aplikačnímu potenciálu, velmi důležitých směrů výzkumu. Předkládaný projekt navazuje na dřívější projekt "Zpracování, analýza a zobrazování multimediálních a 3D dat".

Označení

FIT-S-20-6460

Originální jazyk

čeština

Řešitelé

Zemčík Pavel, prof. Dr. Ing., dr. h. c. - hlavní řešitel
Ali Anas - spoluřešitel
Bambušek Daniel, Ing. - spoluřešitel
Bartl Vojtěch, Ing., Ph.D. - spoluřešitel
Baskar Murali Karthick, Ing., Ph.D. - spoluřešitel
Behúň Kamil, Ing. - spoluřešitel
Beneš Karel, Ing., Ph.D. - spoluřešitel
Beran Vítězslav, doc. Ing., Ph.D. - spoluřešitel
Bobák Petr, Ing., Ph.D. - spoluřešitel
Brejcha Jan, Ing., Ph.D. - spoluřešitel
Burget Lukáš, doc. Ing., Ph.D. - spoluřešitel
Čadík Martin, doc. Ing., Ph.D. - spoluřešitel
Černocký Jan, prof. Dr. Ing. - spoluřešitel
Dobeš Petr, Ing. - spoluřešitel
Dočekal Martin, Ing. - spoluřešitel
Doležal Jan, Ing. - spoluřešitel
Egorova Ekaterina, Ing., Ph.D. - spoluřešitel
Fajčík Martin, Ing., Ph.D. - spoluřešitel
Hanzlíček Jiří, Ing. - spoluřešitel
Herout Adam, prof. Ing., Ph.D. - spoluřešitel
Chlubna Tomáš, Ing., Ph.D. - spoluřešitel
Chudý Peter, doc. Ing., Ph.D., MBA - spoluřešitel
Jon Josef, Ing. - spoluřešitel
Káčerik Martin, Ing. - spoluřešitel
Kapinus Michal, Ing., Ph.D. - spoluřešitel
Kišš Martin, Ing. - spoluřešitel
Klepárník Petr, Ing., Ph.D. - spoluřešitel
Klíma Ondřej, Ing., Ph.D. - spoluřešitel
Kobrtek Jozef, Ing., Ph.D. - spoluřešitel
Kocour Martin, Ing. - spoluřešitel
Kodym Oldřich, Ing., Ph.D. - spoluřešitel
Kohút Jan, Ing. - spoluřešitel
Kolář Martin, M.Sc., Ph.D. et Ph.D. - spoluřešitel
Kula Michal, Ing., Ph.D. - spoluřešitel
Landini Federico Nicolás, Ph.D. - spoluřešitel
Lysek Tomáš, Ing. - spoluřešitel
Maršík Lukáš, Ing. - spoluřešitel
Matýšek Michal, Ing. - spoluřešitel
Milet Tomáš, Ing., Ph.D. - spoluřešitel
Mlích Jozef, Ing. - spoluřešitel
Mošner Ladislav, Ing. - spoluřešitel
Musil Marek, Ing. - spoluřešitel
Musil Martin, Ing., Ph.D. - spoluřešitel
Musil Petr, Ing., Ph.D. - spoluřešitel
Najman Pavel, Ing. - spoluřešitel
Nosko Svetozár, Ing., Ph.D. - spoluřešitel
Novotný Ondřej, Ing., Ph.D. - spoluřešitel
Ondel Lucas Antoine Francois, Mgr., Ph.D. - spoluřešitel
Polášek Tomáš, Ing. - spoluřešitel
Pulugundla Bhargav, M.Sc. - spoluřešitel
Silnova Anna, M.Sc., Ph.D. - spoluřešitel
Skácel Miroslav, Ing. - spoluřešitel
Smrž Pavel, doc. RNDr., Ph.D. - spoluřešitel
Španěl Michal, doc. Ing., Ph.D. - spoluřešitel
Špaňhel Jakub, Ing., Ph.D. - spoluřešitel
Švec Ján, Ing. - spoluřešitel
Švec Tomáš, Ing. - spoluřešitel
Tomešek Jan, Ing. - spoluřešitel
Tóth Michal, Ing. - spoluřešitel
Veľas Martin, Ing., Ph.D. - spoluřešitel
Vlk Jan, Ing., Ph.D. - spoluřešitel
Vlnas Michal, Ing. - spoluřešitel
Yusuf Bolaji - spoluřešitel
Žmolíková Kateřina, Ing., Ph.D. - spoluřešitel

Útvary

Ústav počítačové grafiky a multimédií
- interní (1.1.2020 - 31.12.2022)
Fakulta informačních technologií
- příjemce (1.1.2020 - 31.12.2022)

Výsledky

FAJČÍK, M.; MOTLÍČEK, P.; SMRŽ, P.: EventCausalityId; IDIAPERS @ CASE22-TASK 3: Event Causality Identification. https://github.com/idiap/cncsharedtask. URL: https://github.com/idiap/cncsharedtask. (Software)
Detail

BERAN, V.; BAMBUŠEK, D.; HUBINÁK, R.; SEDLMAJER, K.: DROCO; DroCo - Multi-Drone Control Vizualization Tool. https://github.com/robofit/drone_vstool. URL: https://github.com/robofit/drone_vstool. (Software)
Detail

HRADIŠ, M.; KIŠŠ, M.; KODYM, O.; KOHÚT, J.; BENEŠ, K.; BUCHAL, P.: PERO-OCR-PRINT; Software pro adaptabilní rozpoznávání textu starých tisků. https://github.com/DCGM/pero-ocr, pip https://pypi.org/project/pero-ocr/. URL: https://www.fit.vut.cz/research/product/666/. (Software)
Detail

JON, J.; FAJČÍK, M.; DOČEKAL, M.; SMRŽ, P.: CSR20; Official implementation of BUT-FIT's solution from SemEval-2020 Task 4: Commonsense Validation and Explanation. https://github.com/cepin19/semeval2020_task4. URL: https://github.com/cepin19/semeval2020_task4. (Software)
Detail

BREJCHA, J.; ČADÍK, M.: LandscapeAR; LandscapeAR. https://github.com/brejchajan/LandscapeAR. URL: https://github.com/brejchajan/LandscapeAR. (Software)
Detail

JURÁNEK, R.; JURÁNKOVÁ, M.: PCLines-Py; PCLines package for Python. https://github.com/RomanJuranek/pclines-python. URL: https://github.com/RomanJuranek/pclines-python. (Software)
Detail

BAŘINA, D.: dwt-denoise; Image Denoising Software. http://www.fit.vutbr.cz/research/prod/?id=648. URL: http://www.fit.vutbr.cz/research/prod/?id=648. (Software)
Detail

HRADIŠ, M.; KIŠŠ, M.; KOHÚT, J.; BENEŠ, K.; KOSTELNÍK, M.: PERO-INDEXER; Software pro extrakci informace z polostrukturovaných dokumentů. https://github.com/DCGM/pero-indexer, pip https://pypi.org/project/pero-indexer/. URL: https://www.fit.vut.cz/research/product/755/. (Software)
Detail

BREJCHA, J.; ČADÍK, M.: ITR; Immersive Trip Reports. https://github.com/brejchajan/itr. URL: https://github.com/brejchajan/itr. (Software)
Detail

HRADIŠ, M.; KIŠŠ, M.; KOHÚT, J.; BENEŠ, K.; KODYM, O.; BUCHAL, P.; HŘÍBEK, D.: PERO-OCR-HWR; Interaktivní polo-automatické rozpoznávání ručně psaného písma. https://github.com/DCGM/pero_ocr_web. URL: https://github.com/DCGM/pero_ocr_web. (Software)
Detail

FAJČÍK, M.; JON, J.; DOČEKAL, M.; SMRŽ, P.: CR20; Automatic counterfactual reasoning tool from SemEval2020. https://github.com/MFajcik/SemEval_2020_Task-5. URL: https://github.com/MFajcik/SemEval_2020_Task-5. (Software)
Detail

DIEZ SÁNCHEZ, M.; LANDINI, F.; BURGET, L.: x-vectors Diarization (aka VBx); Bayesian HMM based x-vector clustering - VBx. https://github.com/BUTSpeechFIT/VBx. URL: https://github.com/BUTSpeechFIT/VBx. (Software)
Detail

ČADÍK, M.; POLÁŠEK, T.: ICTree; ICTree: Automatic Perceptual Metric for Tree Models. http://cphoto.fit.vutbr.cz/ictree/. URL: http://cphoto.fit.vutbr.cz/ictree/. (Software)
Detail

JURÁNKOVÁ, M.; JURÁNEK, R.: DiamondSpace-Py; DiamondSpace package for Python. https://github.com/MarketaJu/diamond_space. URL: https://github.com/MarketaJu/diamond_space. (Software)
Detail

JURÁNEK, R.: LIBRECTIFY-CPP; librectify - library for automatic perspective correction of images. https://github.com/RomanJuranek/librectify. URL: https://github.com/RomanJuranek/librectify. (Software)
Detail

HELMKE, H.; KLEINERT, M.; SHETTY, S.; OHNEISER, O.; EHR, H.; PRASAD, A.; MOTLÍČEK, P.; VESELÝ, K.; ONDŘEJ, K.; SMRŽ, P.; HARFMANN, J.; WINDISCH, C. Readback Error Detection by Automatic Speech Recognition to Increase ATM Safety. In Proceedings of ATM Seminar. on-line: EUROPEAN ORGANISATION FOR THE SAFETY OF AIR NAVIGATION, 2021. p. 1-10.
Detail

SUBRAMANIAN, A.; WANG, X.; BASKAR, M.; WATANABE, S.; TANIGUCHI, T.; TRAN, D.; FUJITA, Y. Speech Enhancement Using End-to-End Speech Recognition Objectives. In IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. New Paltz, NY: IEEE Signal Processing Society, 2019. p. 234-238. ISBN: 978-1-7281-1123-0.
Detail

KLEINERT, M.; HELMKE, H.; SHETTY, S.; OHNEISER, O.; EHR, H.; PRASAD, A.; MOTLÍČEK, P.; HARFMANN, J. Automated Interpretation of Air Traffic Control Communication: The Journey from Spoken Words to a Deeper Understanding of the Meaning. In Proceedings of DASC 2021. San Antonio, Texas: Institute of Electrical and Electronics Engineers, 2021. p. 1-9. ISBN: 978-1-6654-3420-1.
Detail

BARTL, V.; JURÁNEK, R.; ŠPAŇHEL, J.; HEROUT, A. PlaneCalib: Automatic Camera Calibration by Multiple Observations of Rigid Objects on Plane. In 2020 International Conference on Digital Image Computing: Techniques and Applications (DICTA). Melbourne: Institute of Electrical and Electronics Engineers, 2020. p. 1-8. ISBN: 978-1-7281-9108-9.
Detail

KONÍKOVÁ, L.; KRÁLÍK, M.; KLÍMA, O.; ČUTA, M. Does parental similarity degree affect the development of their offspring?. Anthropologia integra, 2022, vol. 13, no. 1, p. 15-29. ISSN: 1804-6657.
Detail

POLCEROVÁ, L.; CHOVANCOVÁ, M.; KRÁLÍK, M.; BEŇUŠ, R.; KLÍMA, O.; MEINEROVÁ, T.; ČUTA, M.; PETROVÁ, M. Radioulnar Contrasts in Fingerprint Ridge Counts: Searching for Dermatoglyphic Markers of Early Sex Development. AMERICAN JOURNAL OF HUMAN BIOLOGY, 2022, vol. 34, no. 5, p. 1-15. ISSN: 1520-6300.
Detail

KAPINUS, M.; MATERNA, Z.; BAMBUŠEK, D.; BERAN, V. End-User Robot Programming Case Study: Augmented Reality vs. Teach Pendant. In Companion of the 2020 ACM/IEEE International Conference on Human-Robot Interaction. Cambridge: Association for Computing Machinery, 2020. p. 281-283. ISBN: 978-1-4503-7057-8.
Detail

BAŘINA, D. Convergence verification of the Collatz problem. JOURNAL OF SUPERCOMPUTING, 2021, vol. 77, no. 3, p. 2681-2688. ISSN: 1573-0484.
Detail

ZEINALI, H.; LEE, K.; ALAM, J.; BURGET, L. SdSV Challenge 2020: Large-Scale Evaluation of Short-duration Speaker Verification. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Proceedings of Interspeech. Shanghai: International Speech Communication Association, 2020. no. 10, p. 731-735. ISSN: 1990-9772.
Detail

SCHARENBORG, O.; BESACIER, L.; BLACK, A.; HASEGAWA-JOHNSON, M.; METZE, F.; NEUBIG, G.; STÜKER, S.; GODARD, P.; MÜLLER, M.; ONDEL YANG, L.; PALASKAR, S.; ARTHUR, P.; CIANNELLA, F.; DU, M.; LARSEN, E.; MERKX, D.; RIAD, R.; WANG, L.; DUPOUX, E. Speech Technology for Unwritten Languages. IEEE-ACM Transactions on Audio Speech and Language Processing, 2020, vol. 2020, no. 28, p. 964-975. ISSN: 2329-9290.
Detail

BAŘINA, D.; KLÍMA, O. x3: Lossless Data Compressor. Data Compression Conference 2022. Washington: IEEE Computer Society, 2022. p. 441-441.
Detail

BAŘINA, D.; ŠOLONY, M.; CHLUBNA, T.; DLABAJA, D.; KLÍMA, O.; ZEMČÍK, P. Comparison of light field compression methods. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, vol. 81, no. 2, p. 2517-2528. ISSN: 1573-7721.
Detail

WANG, S.; ROHDIN, J.; PLCHOT, O.; BURGET, L.; YU, K.; ČERNOCKÝ, J. Investigation of Specaugment for Deep Speaker Embedding Learning. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Barcelona: IEEE Signal Processing Society, 2020. p. 7139-7143. ISBN: 978-1-5090-6631-5.
Detail

MOŠNER, L.; PLCHOT, O.; ROHDIN, J.; ČERNOCKÝ, J. Utilizing VOiCES dataset for multichannel speaker verification with beamforming. Proceedings of Odyssey 2020 The Speaker and Language Recognition Workshop. Proceedings of Odyssey: The Speaker and Language Recognition Workshop Odyssey 2014, Joensuu, Finland. Tokyo: International Speech Communication Association, 2020. no. 11, p. 187-193. ISSN: 2312-2846.
Detail

KUBÍK, T.; ŠPANĚL, M. Robust Teeth Detection in 3D Dental Scans by Automated Multi-View Landmarking. In 15th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2022). Vienna: Institute for Systems and Technologies of Information, Control and Communication, 2022. p. 24-34. ISBN: 978-989-758-552-4.
Detail

SOPHOCLEOUS, M.; LESSI, C.; XU, Z.; ŠPAŇHEL, J.; QIU, R.; LENDINEZ, A.; CHONDROULIS, I.; BELIKAIDIS, I. AI-driven intent-based networking for 5G enhanced robot autonomy. In AIAI 2022 IFIP WG 12.5 International Workshops. IFIP Advances in Information and Communication Technology. IFIP Advances in Information and Communication Technology. Cham: Springer Nature Switzerland AG, 2022. no. 2022, p. 61-70. ISSN: 1868-422X.
Detail

BARTL, V.; ŠPAŇHEL, J.; HEROUT, A. PersonGONE: Image Inpainting for Automated Checkout Solution. In 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops. IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops. New Orleans, LA: IEEE Computer Society, 2022. no. 7, p. 3114-3122. ISSN: 2160-7516.
Detail

POLÁŠEK, T.; HRŮŠA, D.; BENEŠ, B.; ČADÍK, M. ICTree: Automatic Perceptual Metrics for Tree Models. ACM TRANSACTIONS ON GRAPHICS, 2021, vol. 40, no. 6, p. 1-15. ISSN: 0730-0301.
Detail

BAŘINA, D.; KLÍMA, O. Region of interest in JPEG. In WSCG 2022 Proceedings. Computer Science Research Notes. Plzeň: Union Agency, 2022. p. 1-4. ISBN: 978-80-86943-64-0.
Detail

BUREŠ, J.; EEROLA, T.; LENSU, L.; KÄLVIÄINEN, H.; ZEMČÍK, P. Plankton Recognition in Images with Varying Size. Lecture Notes in Computer Science, 2021, vol. 12666, no. 2, p. 110-120. ISSN: 0302-9743.
Detail

MADEJA, R.; BAJOR, G.; KLÍMA, O.; BIALY, L.; POMETLOVÁ, J. Computer-assisted preoperative planning of reduction of and osteosynthesis of scapular fracture: A case report. Open Medicine, 2021, vol. 16, no. 1, p. 1597-1601. ISSN: 2391-5463.
Detail

BAŘINA, D.; KLÍMA, O. JPEG 2000: Guide for Digital Libraries. Digital Library Perspectives, 2020, vol. 36, no. 3, p. 249-263. ISSN: 2059-5816.
Detail

KOŠŤÁL, M.; NOSEK, V.; KLÍMA, O. Morfologická analýza nádob KPT prostřednictvím 3D modelu. Bořetice: 2021. 10 s.
Detail

KRÁLÍK, M.; KONÍKOVÁ, L.; ARSLAN, A.; POLCEROVÁ, L.; ČUTA, M.; HLOŽEK, M.; KLÍMA, O. Age, sex and positional variations in the human epidermal ridge breadth by multiple measurements on a crosssectional sample of schoolage children. Anthropologie, 2022, vol. 60, no. 2, p. 379-402. ISSN: 2570-9127.
Detail

PEČIVA, J. Tutoriál Vulkan 2022. ROOT, informace nejen ze světa Linuxu, 2022, roč. 2022, č. 1, s. 9-13. ISSN: 1212-8309.
Detail

LANDINI, F.; WANG, S.; DIEZ SÁNCHEZ, M.; BURGET, L.; MATĚJKA, P.; ŽMOLÍKOVÁ, K.; MOŠNER, L.; SILNOVA, A.; PLCHOT, O.; NOVOTNÝ, O.; ZEINALI, H.; ROHDIN, J. But System for the Second Dihard Speech Diarization Challenge. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Barcelona: IEEE Signal Processing Society, 2020. p. 6529-6533. ISBN: 978-1-5090-6631-5.
Detail

BASKAR, M.; ROSENBERG, A.; RAMABHADRAN, B.; ZHANG, Y. Reducing Domain mismatch in Self-supervised speech pre-training. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Proceedings of Interspeech. Incheon: International Speech Communication Association, 2022. no. 9, p. 3028-3032. ISSN: 1990-9772.
Detail

HELMKE, H.; SHETTY, S.; KLEINERT, M.; OHNEISER, O.; EHR, H.; MOTLÍČEK, P.; PRASAD, A.; WINDISCH, C. Measuring Speech Recognition And Understanding Performance in Air Traffic Control Domain Beyond Word Error Rates. In Proceedings of 11th SESAR Innovation Days 2021. Belgie: 2021. p. 1-8.
Detail

ŽMOLÍKOVÁ, K.; KOCOUR, M.; LANDINI, F.; BENEŠ, K.; KARAFIÁT, M.; VYDANA, H.; LOZANO DÍEZ, A.; PLCHOT, O.; BASKAR, M.; ŠVEC, J.; MOŠNER, L.; MALENOVSKÝ, V.; BURGET, L.; YUSUF, B.; NOVOTNÝ, O.; GRÉZL, F.; SZŐKE, I.; ČERNOCKÝ, J. BUT System for CHiME-6 Challenge. Proceedings of CHiME 2020 Virtual Workshop. Barcelona: University of Sheffield, 2020. p. 1-3.
Detail

KOHÚT, J.; HRADIŠ, M. TS-Net: OCR Trained to Switch Between Text Transcription Styles. In Lladós J., Lopresti D., Uchida S. (eds) Document Analysis and Recognition - ICDAR 2021. Lecture Notes in Computer Science. Lecture Notes in Computer Science. Lausanne: Springer Nature Switzerland AG, 2021. no. 1, p. 478-493. ISBN: 978-3-030-86336-4. ISSN: 0302-9743.
Detail

PEČIVA, J. Tutoriál Vulkan 2021. ROOT, informace nejen ze světa Linuxu, 2021, roč. 2021, č. 1, s. 1-8. ISSN: 1212-8309.
Detail

KOCOUR, M.; VESELÝ, K.; SZŐKE, I.; KESIRAJU, S.; ZULUAGA-GOMEZ, J.; BLATT, A.; PRASAD, A.; NIGMATULINA, I.; MOTLÍČEK, P.; KLAKOW, D.; TART, A.; KOLČÁREK, P.; ČERNOCKÝ, J.; CEVENINI, C.; CHOUKRI, K.; RIGAULT, M.; LANDIS, F.; SARFJOO, S. Automatic Processing Pipeline for Collecting and Annotating Air-Traffic Voice Communication Data. In Proceedings of 9th OpenSky Symposium 2021, OpenSky Network, Brussels, Belgium. Proceedings. Brussels: MDPI, 2021. no. 12, p. 1-10. ISSN: 2504-3900.
Detail

GAJDOŠECH, L.; KOCUR, V.; STUCHLÍK, M.; HUDEC, L.; MADARAS, M. Towards Deep Learning-based 6D Bin Pose Estimation in 3D Scans. In Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4 VISAPP: VISAPP. Setubal: SciTePress - Science and Technology Publications, 2022. p. 545-552. ISBN: 978-989-758-555-5.
Detail

BAŘINA, D. 7x +- 1: Close Relative of the Collatz Problem. Computational Methods in Science and Technology, 2022, vol. 28, no. 4, p. 143-147. ISSN: 1505-0602.
Detail

ONDEL YANG, L.; LAM-YEE-MUI, L.; KOCOUR, M.; CORRO, C.; BURGET, L. GPU-Accelerated Forward-Backward Algorithm with Application to Lattice-Free MMI. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Singapore: IEEE Signal Processing Society, 2022. p. 8417-8421. ISBN: 978-1-6654-0540-9.
Detail

KODYM, O.; HRADIŠ, M. TG2: text-guided transformer GAN for restoring document readability and perceived quality. International Journal on Document Analysis and Recognition, 2021, vol. 2021, no. 1, p. 1-14. ISSN: 1433-2825.
Detail

ČUTA, M.; POLCEROVÁ, L.; KLÍMA, O.; ŠKULTÉTYOVÁ, A.; ZEMČÍK, P.; KRÁLÍK, M. Relationship between Height Growth in Adolescence and Dermatoglyphic Radioulnar Ridge Count Contrasts in the Children and Their Mothers. Anthropologie, 2022, vol. 60, no. 2, p. 1-10. ISSN: 2570-9127.
Detail

DUNBAR, E.; KARADAYI, J.; BERNARD, M.; CAO, X.; ALGAYRES, R.; ONDEL YANG, L.; BESACIER, L.; SAKTI, S.; DUPOUX, E. The Zero Resource Speech Challenge 2020: Discovering discrete subword and word units. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Proceedings of Interspeech. Shanghai: International Speech Communication Association, 2020. no. 10, p. 4831-4835. ISSN: 1990-9772.
Detail

KAPINUS, M.; BAMBUŠEK, D.; MATERNA, Z.; BERAN, V.; SMRŽ, P. Improved Indirect Virtual Objects Selection Methods for Cluttered Augmented Reality Environments on Mobile Devices. In HRI '22: Proceedings of the 2022 ACM/IEEE International Conference on Human-Robot Interaction. Sapporo, Hokkaido: Association for Computing Machinery, 2022. p. 834-838. ISBN: 978-1-6654-0731-1.
Detail

TOMEŠEK, J.; ČADÍK, M.; BREJCHA, J. CrossLocate: Cross-Modal Large-Scale Visual Geo-Localization in Natural Environments using Rendered Modalities. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV). Waikoloa: Institute of Electrical and Electronics Engineers, 2022. p. 2193-2202. ISBN: 978-1-6654-0477-8.
Detail

SILNOVA, A.; BRUMMER, J.; ROHDIN, J.; STAFYLAKIS, T.; BURGET, L. Probabilistic embeddings for speaker diarization. Proceedings of Odyssey 2020 The Speaker and Language Recognition Workshop. Proceedings of Odyssey: The Speaker and Language Recognition Workshop Odyssey 2014, Joensuu, Finland. Tokyo: International Speech Communication Association, 2020. no. 11, p. 24-31. ISSN: 2312-2846.
Detail

BAŘINA, D. Comparison of Lossless Image Formats. In WSCG 2021 Proceedings. Computer Science Research Notes. Plzen: Union Agency, 2021. p. 339-342. ISBN: 978-80-86943-34-3.
Detail

YUSUF, B.; GOK, A.; GUNDOGDU, B.; SARAÇLAR, M. End-to-End Open Vocabulary Keyword Search. In Proceedings Interspeech 2021. Proceedings of Interspeech. Brno: International Speech Communication Association, 2021. no. 8, p. 4388-4392. ISSN: 1990-9772.
Detail

HAVRÁNEK, P.; ZŮVALA, R.; ŠPAŇHEL, J.; HEROUT, A.; VALENTOVÁ, V.; AMBROS, J. How does road marking in horizontal curves influence driving behaviour?. European Transport Research Review, 2020, vol. 12, no. 1, p. 1-11. ISSN: 1866-8887.
Detail

BREJCHA, J.; LUKÁČ, M.; HOLD-GEOFFROY, Y.; WANG, O.; ČADÍK, M. LandscapeAR: Large Scale Outdoor Augmented Reality by Matching Photographs with Terrain Models Using Learned Descriptors. In Computer Vision - ECCV 2020. Lecture Notes in Computer Science. Cham: Springer Nature Switzerland AG, 2020. p. 295-312. ISBN: 978-3-030-58525-9.
Detail

YUSUF, B.; GANDHE, A.; SOKOLOV, A. Usted: Improving ASR with a Unified Speech and Text Encoder-Decoder. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Singapore: IEEE Signal Processing Society, 2022. p. 8297-8301. ISBN: 978-1-6654-0540-9.
Detail

HEROUT, A.; KODYM, O.; ŠPANĚL, M. AutoImplant 2020-First MICCAI Challenge on Automatic Cranial Implant Design. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2021, vol. 40, no. 9, p. 2329-2342. ISSN: 0278-0062.
Detail

FOLENTA, J.; ŠPAŇHEL, J.; BARTL, V.; HEROUT, A. Determining Vehicle Turn Counts at Multiple Intersections by Separated Vehicle Classes Using CNNs. In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops. IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops. Seattle, WA: IEEE Computer Society, 2020. no. 07, p. 2544-2549. ISBN: 978-1-7281-9360-1. ISSN: 2160-7516.
Detail

KOSIBA, M.; BURGET, L. Multiwavelength classification of X-ray selected galaxy cluster candidates using convolutional neural networks. Monthly Notices of the Royal Astronomical Society, 2020, vol. 496, no. 4, p. 4141-4153. ISSN: 1365-2966.
Detail

LOZANO DÍEZ, A.; SILNOVA, A.; PULUGUNDLA, B.; ROHDIN, J.; VESELÝ, K.; BURGET, L.; PLCHOT, O.; GLEMBEK, O.; NOVOTNÝ, O.; MATĚJKA, P. BUT Text-Dependent Speaker Verification System for SdSV Challenge 2020. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Proceedings of Interspeech. Shanghai: International Speech Communication Association, 2020. no. 10, p. 761-765. ISSN: 1990-9772.
Detail

MLÍCH, J.; KOPLÍK, K.; HRADIŠ, M.; ZEMČÍK, P. Fire Segmentation in Still Images. In Springer International Publishing. Lecture Notes in Computer Science. Auckland: Springer International Publishing, 2020. p. 27-37. ISBN: 978-3-030-40605-9.
Detail

BOBÁK, P.; ČMOLÍK, L.; ČADÍK, M. Temporally Stable Boundary Labeling for Interactive and Non-Interactive Dynamic Scenes. COMPUTERS & GRAPHICS-UK, 2020, vol. 91, no. 10, p. 265-278. ISSN: 0097-8493.
Detail

BAŘINA, D. Multiplication Algorithm Based on Collatz function. THEORY OF COMPUTING SYSTEMS, 2020, vol. 64, no. 8, p. 1331-1337. ISSN: 1433-0490.
Detail

ZULUAGA-GOMEZ, J.; MOTLÍČEK, P.; ZHAN, Q.; VESELÝ, K.; BRAUN, R. Automatic Speech Recognition Benchmark for Air-Traffic Communications. In Proceedings of Interspeech 2020. Proceedings of Interspeech. Shanghai: International Speech Communication Association, 2020. no. 10, p. 2297-2301. ISSN: 1990-9772.
Detail

KODYM, O.; ŠPANĚL, M.; HEROUT, A. Segmentation of Defective Skulls from CT Data for Tissue Modelling. In Towards the Automatization of Cranial Implant Design in Cranioplasty II. Lecture Notes in Computer Science. Lecture Notes in Computer Science. Strasbourg: Springer Nature Switzerland AG, 2021. no. 13123, p. 19-28. ISBN: 978-3-030-92652-6. ISSN: 0302-9743.
Detail

KODYM, O.; ŠPANĚL, M.; HEROUT, A. Deep Learning for Cranioplasty in Clinical Practice: Going from Synthetic to Real Patient Data. COMPUTERS IN BIOLOGY AND MEDICINE, 2021, vol. 137, no. 104766, p. 1-10. ISSN: 0010-4825.
Detail

DVOŘÁKOVÁ, M.; HRADIŠ, M.; ŽABIČKA, P.; KOHÚT, J.; KIŠŠ, M.; BENEŠ, K. Využití PERO OCR při přepisu rukopisů. Archivní časopis, 2022, roč. 72, č. 1, s. 14-27. ISSN: 0004-0398.
Detail

KIŠŠ, M.; KOHÚT, J.; BENEŠ, K.; HRADIŠ, M. Importance of Textlines in Historical Document Classification. In Uchida, S., Barney, E., Eglin, V. (eds) Document Analysis Systems. Lecture Notes in Computer Science. La Rochelle: Springer Nature Switzerland AG, 2022. p. 158-170. ISBN: 978-3-031-06554-5.
Detail

KOCOUR, M.; VESELÝ, K.; BLATT, A.; ZULUAGA-GOMEZ, J.; SZŐKE, I.; ČERNOCKÝ, J.; KLAKOW, D.; MOTLÍČEK, P. Boosting of Contextual Information in ASR for Air-Traffic Call-Sign Recognition. In Proceedings Interspeech 2021. Proceedings of Interspeech. Brno: International Speech Communication Association, 2021. no. 8, p. 3301-3305. ISSN: 1990-9772.
Detail

KLEPÁRNÍK, P.; ZEMČÍK, P.; TREEBY, B.; JAROŠ, J. On-the-Fly Calculation of Time-Averaged Acoustic Intensity in Time-Domain Ultrasound Simulations Using a k-Space Pseudospectral Method. IEEE TRANSACTIONS ON ULTRASONICS FERROELECTRICS AND FREQUENCY CONTROL, 2022, vol. 69, no. 10, p. 2917-2929. ISSN: 1525-8955.
Detail

BAŘINA, D. Real-time wavelet transform for infinite image strips. Journal of Real-Time Image Processing, 2021, vol. 18, no. 3, p. 585-591. ISSN: 1861-8200.
Detail

BARTL, V.; ŠPAŇHEL, J.; DOBEŠ, P.; JURÁNEK, R.; HEROUT, A. Automatic Camera Calibration by Landmarks on Rigid Objects. Machine vision and applications, 2020, vol. 32, no. 1, p. 2-15. ISSN: 1432-1769.
Detail

DOBEŠ, P.; ŠPAŇHEL, J.; BARTL, V.; JURÁNEK, R.; HEROUT, A. Density-Based Vehicle Counting with Unsupervised Scale Selection. In Digital Image Computing: Techniques and Applications 2020. Melbourne: Institute of Electrical and Electronics Engineers, 2020. p. 1-8. ISBN: 978-1-7281-9108-9.
Detail

FAJČÍK, M.; DOČEKAL, M.; JON, J.; SMRŽ, P. BUT-FIT at SemEval-2020 Task 5: Automatic detection of counterfactual statements with deep pre-trained language representation models. In Proceedings of the Fourteenth Workshop on Semantic Evaluation. Barcelona (online): Association for Computational Linguistics, 2020. p. 437-444. ISBN: 978-1-952148-31-6.
Detail

RAJASEKARAN, S.; KANG, H.; ČADÍK, M.; GALIN, E.; GUÉRIN, E.; PEYTAVIE, A.; SLAVÍK, P.; BENEŠ, B. PTRM: Perceived Terrain Realism Metric. ACM Transactions on Applied Perception, 2022, vol. 19, no. 2, p. 1-22. ISSN: 1544-3558.
Detail

KOBRTEK, J.; MILET, T.; TÓTH, M.; HEROUT, A. Comparison of Modern Omnidirectional Precise Shadowing Techniques Versus Ray Tracing. Computer Graphics Forum, 2022, vol. 41, no. 1, p. 106-121. ISSN: 0167-7055.
Detail

CHLUBNA, T.; MILET, T.; ZEMČÍK, P. Real-time per-pixel focusing method for light field rendering. Computational Visual Media, 2021, vol. 2021, no. 7, p. 319-333. ISSN: 2096-0662.
Detail

DELCROIX, M.; OCHIAI, T.; ŽMOLÍKOVÁ, K.; KINOSHITA, K.; TAWARA, N.; NAKATANI, T.; ARAKI, S. Improving Speaker Discrimination of Target Speech Extraction With Time-Domain Speakerbeam. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Barcelona: IEEE Signal Processing Society, 2020. p. 691-695. ISBN: 978-1-5090-6631-5.
Detail

DIEZ SÁNCHEZ, M.; BURGET, L.; LANDINI, F.; WANG, S.; ČERNOCKÝ, J. Optimizing Bayesian Hmm Based X-Vector Clustering for the Second Dihard Speech Diarization Challenge. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Barcelona: IEEE Signal Processing Society, 2020. p. 6519-6523. ISBN: 978-1-5090-6631-5.
Detail

ALAM, J.; BOULIANNE, G.; BURGET, L.; DAHMANE, M.; DIEZ SÁNCHEZ, M.; GLEMBEK, O.; LALONDE, M.; LOZANO DÍEZ, A.; MATĚJKA, P.; MIZERA, P.; MOŠNER, L.; NOISEUX, C.; MONTEIRO, J.; NOVOTNÝ, O.; PLCHOT, O.; ROHDIN, J.; SILNOVA, A.; SLAVÍČEK, J.; STAFYLAKIS, T.; ST-CHARLES, P.; WANG, S.; ZEINALI, H. Analysis of ABC Submission to NIST SRE 2019 CMN and VAST Challenge. Proceedings of Odyssey 2020 The Speaker and Language Recognition Workshop. Proceedings of Odyssey: The Speaker and Language Recognition Workshop Odyssey 2014, Joensuu, Finland. Tokyo: International Speech Communication Association, 2020. no. 11, p. 289-295. ISSN: 2312-2846.
Detail

KESIRAJU, S.; PLCHOT, O.; BURGET, L.; GANGASHETTY, S. Learning Document Embeddings Along With Their Uncertainties. IEEE-ACM Transactions on Audio Speech and Language Processing, 2020, vol. 2020, no. 28, p. 2319-2332. ISSN: 2329-9290.
Detail

ZULUAGA-GOMEZ, J.; VESELÝ, K.; BLATT, A.; MOTLÍČEK, P.; KLAKOW, D.; TART, A.; SZŐKE, I.; PRASAD, A.; SARFJOO, S.; KOLČÁREK, P.; KOCOUR, M.; ČERNOCKÝ, J.; CEVENINI, C.; CHOUKRI, K.; RIGAULT, M.; LANDIS, F. Automatic Call Sign Detection: Matching Air Surveillance Data with Air Traffic Spoken Communications. Proceedings of the 8th OpenSky Symposium 2020. Proceedings. Brusel: MDPI, 2020. no. 59, p. 1-10. ISSN: 2504-3900.
Detail

MUSIL, M.; NOSKO, S.; ZEMČÍK, P. De-ghosted HDR video acquisition for embedded systems: Ghost-free HDR video of motion objects from stationary cameras. Journal of Real-Time Image Processing, 2020, vol. 2020, no. 1, p. 1-10. ISSN: 1861-8200.
Detail

KODYM, O.; HRADIŠ, M. Page Layout Analysis System for Unconstrained Historic Documents. In Lladós J., Lopresti D., Uchida S. (eds) Document Analysis and Recognition - ICDAR 2021. Lecture Notes in Computer Science. Lausanne: Springer Nature Switzerland AG, 2021. p. 492-506. ISBN: 978-3-030-86330-2.
Detail

BASKAR, M.; ROSENBERG, A.; RAMABHADRAN, B.; ZHANG, Y.; MORENO, P. Ask2Mask: Guided Data Selection for Masked Speech Modeling. IEEE Journal of Selected Topics in Signal Processing, 2022, vol. 16, no. 6, p. 1357-1366. ISSN: 1932-4553.
Detail

ČADÍK, M.; BREJCHA, J.; LUKÁČ, M.; CHEN, Z.; VUT, Adobe: Generating immersive trip photograph visualizations. US10825246B2, Patent. (2020)
Detail

WANNER, L.; KLUSCH, M.; MAVROPOULOS, A.; JAMIN, E.; MARIN PUCHADES, V.; CASAMAYOR, G.; ČERNOCKÝ, J.; EGOROVA, E. Towards a Versatile Intelligent Conversational Agent as Personal Assistant for Migrants. In The PAAMS Collection. PAAMS 2021: Advances in Practical Applications of Agents, Multi-Agent Systems, and Social Good. Lecture Notes in Computer Science. Lecture Notes in Computer Science book series. Salamanca: Springer International Publishing, 2021. no. 10, p. 316-327. ISBN: 978-3-030-85739-4. ISSN: 0302-9743.
Detail

JURÁNEK, R.; VÝRAVSKÝ, J.; KOLÁŘ, M.; MOTL, D.; ZEMČÍK, P. Graph-based deep learning segmentation of EDS spectral images for automated mineral phase analysis. COMPUTERS & GEOSCIENCES, 2022, vol. 165, no. 8, p. 1-2. ISSN: 0098-3004.
Detail

MATĚJKA, P.; PLCHOT, O.; GLEMBEK, O.; BURGET, L.; ROHDIN, J.; ZEINALI, H.; MOŠNER, L.; SILNOVA, A.; NOVOTNÝ, O.; DIEZ SÁNCHEZ, M.; ČERNOCKÝ, J. 13 years of speaker recognition research at BUT, with longitudinal analysis of NIST SRE. COMPUTER SPEECH AND LANGUAGE, 2020, vol. 2020, no. 63, p. 1-15. ISSN: 0885-2308.
Detail

ZULUAGA-GOMEZ, J.; NIGMATULINA, I.; PRASAD, A.; MOTLÍČEK, P.; VESELÝ, K.; KOCOUR, M.; SZŐKE, I. Contextual Semi-Supervised Learning: An Approach to Leverage Air-Surveillance and Untranscribed ATC Data in ASR Systems. In Proceedings Interspeech 2021. Proceedings of Interspeech. Brno: International Speech Communication Association, 2021. no. 8, p. 3296-3300. ISSN: 1990-9772.
Detail