Přístupnostní navigace
E-přihláška
Vyhledávání Vyhledat Zavřít
Detail publikačního výsledku
LANDINI, F.; WANG, S.; DIEZ SÁNCHEZ, M.; BURGET, L.; MATĚJKA, P.; ŽMOLÍKOVÁ, K.; MOŠNER, L.; SILNOVA, A.; PLCHOT, O.; NOVOTNÝ, O.; ZEINALI, H.; ROHDIN, J.
Originální název
But System for the Second Dihard Speech Diarization Challenge
Anglický název
Druh
Stať ve sborníku v databázi WoS či Scopus
Originální abstrakt
This paper describes the winning systems developed by theBUT team for the four tracks of the Second DIHARD SpeechDiarization Challenge. For tracks 1 and 2 the systems weremainly based on performing agglomerative hierarchical clustering(AHC) of x-vectors, followed by another x-vectorclustering based on Bayes hidden Markov model and variationalBayes inference. We provide a comparison of theimprovement given by each step and share the implementationof the core of the system. For tracks 3 and 4 withrecordings from the Fifth CHiME Challenge, we exploreddifferent approaches for doing multi-channel diarization andour best performance was obtained when applying AHC onthe fusion of per channel probabilistic linear discriminantanalysis scores.
Anglický abstrakt
Klíčová slova
Speaker Diarization, Variational Bayes, HMM, DIHARD, CHiME
Klíčová slova v angličtině
Autoři
Rok RIV
2021
Vydáno
04.05.2020
Nakladatel
IEEE Signal Processing Society
Místo
Barcelona
ISBN
978-1-5090-6631-5
Kniha
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Strany od
6529
Strany do
6533
Strany počet
5
URL
https://ieeexplore.ieee.org/document/9054251
Plný text v Digitální knihovně
http://hdl.handle.net/
BibTex
@inproceedings{BUT163962, author="Federico Nicolás {Landini} and Shuai {Wang} and Mireia {Diez Sánchez} and Lukáš {Burget} and Pavel {Matějka} and Kateřina {Žmolíková} and Ladislav {Mošner} and Anna {Silnova} and Oldřich {Plchot} and Ondřej {Novotný} and Hossein {Zeinali} and Johan Andréas {Rohdin}", title="But System for the Second Dihard Speech Diarization Challenge", booktitle="ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings", year="2020", pages="6529--6533", publisher="IEEE Signal Processing Society", address="Barcelona", doi="10.1109/ICASSP40776.2020.9054251", isbn="978-1-5090-6631-5", url="https://ieeexplore.ieee.org/document/9054251" }
Dokumenty
landini_icassp2020_09054251