Přístupnostní navigace
E-přihláška
Vyhledávání Vyhledat Zavřít
Detail publikačního výsledku
LI, J.; MAK, M.; ROHDIN, J.; LEE, K.; HERMANSKY, H.
Originální název
Bayesian Learning for Domain-Invariant Speaker Verification and Anti-Spoofing
Anglický název
Druh
Stať ve sborníku v databázi WoS či Scopus
Originální abstrakt
The performance of automatic speaker verification (ASV) and anti-spoofing drops seriously under real-world domain mismatch conditions. The relaxed instance frequency-wise normalization (RFN), which normalizes the frequency components based on the feature statistics along the time and channel axes, is a promising approach to reducing the domain dependence in the feature maps of a speaker embedding network. We advocate that the different frequencies should receive different weights and that the weights' uncertainty due to domain shift should be accounted for. To these ends, we propose leveraging variational inference to model the posterior distribution of the weights, which results in Bayesian weighted RFN (BWRFN). This approach overcomes the limitations of fixed-weight RFN, making it more effective under domain mismatch conditions. Extensive experiments on cross-dataset ASV, cross-TTS anti-spoofing, and spoofing-robust ASV show that BWRFN is significantly better than WRFN and RFN.
Anglický abstrakt
Klíčová slova
anti-spoofing | Bayesian learning | domain generalization | speaker verification
Klíčová slova v angličtině
Autoři
Rok RIV
2026
Vydáno
17.08.2025
Nakladatel
International Speech Communication Association
Místo
Rotterdam
Kniha
Proceedings of the Annual Conference of the International Speech Communication Association Interspeech
Periodikum
Interspeech
Stát
Nizozemsko
Strany od
1123
Strany do
1127
Strany počet
5
URL
https://www.isca-archive.org/interspeech_2025/li25h_interspeech.pdf
BibTex
@inproceedings{BUT199931, author="{} and {} and Johan Andréas {Rohdin} and {} and Hynek {Heřmanský}", title="Bayesian Learning for Domain-Invariant Speaker Verification and Anti-Spoofing", booktitle="Proceedings of the Annual Conference of the International Speech Communication Association Interspeech", year="2025", journal="Interspeech", pages="1123--1127", publisher="International Speech Communication Association", address="Rotterdam", doi="10.21437/Interspeech.2025-655", url="https://www.isca-archive.org/interspeech_2025/li25h_interspeech.pdf" }
Dokumenty
Jin Li_2025h_interspeech