Přístupnostní navigace
E-application
Search Search Close
Publication result detail
STAFYLAKIS, T.; SILNOVA, A.; ROHDIN, J.; PLCHOT, O.; BURGET, L.
Original Title
Challenging margin-based speaker embedding extractors by using the variational information bottleneck
English Title
Type
Paper in proceedings (conference paper)
Original Abstract
Speaker embedding extractors are typically trained using a classification loss over the training speakers. During the last few years, the standard softmax/cross-entropy loss has been replaced by the margin-based losses, yielding significant im- provements in speaker recognition accuracy. Motivated by the fact that the margin merely reduces the logit of the target speaker during training, we consider a probabilistic framework that has a similar effect. The variational information bottle- neck provides a principled mechanism for making deterministic nodes stochastic, resulting in an implicit reduction of the pos- terior of the target speaker. We experiment with a wide range of speaker recognition benchmarks and scoring methods and re- port competitive results to those obtained with the state-of-the- art Additive Angular Margin loss.
English abstract
Keywords
speaker recognition, variational information bottleneck
Key words in English
Authors
RIV year
2025
Released
01.09.2024
Publisher
International Speech Communication Association
Location
Kos
Book
Proceedings of Interspeech 2024
ISBN
1990-9772
Periodical
Proceedings of Interspeech
Volume
2024
Number
9
State
French Republic
Pages from
3220
Pages to
3224
Pages count
5
URL
https://www.isca-archive.org/interspeech_2024/stafylakis24_interspeech.pdf
BibTex
@inproceedings{BUT193738, author="Themos {Stafylakis} and Anna {Silnova} and Johan Andréas {Rohdin} and Oldřich {Plchot} and Lukáš {Burget}", title="Challenging margin-based speaker embedding extractors by using the variational information bottleneck", booktitle="Proceedings of Interspeech 2024", year="2024", journal="Proceedings of Interspeech", volume="2024", number="9", pages="3220--3224", publisher="International Speech Communication Association", address="Kos", doi="10.21437/Interspeech.2024-2058", issn="1990-9772", url="https://www.isca-archive.org/interspeech_2024/stafylakis24_interspeech.pdf" }
Documents
stafylakis_interspeech_2024