Detail publikačního výsledku

Bayesian HMM based x-vector clustering for Speaker Diarization

DIEZ SÁNCHEZ, M.; BURGET, L.; WANG, S.; ROHDIN, J.; ČERNOCKÝ, J.

Originální název

Bayesian HMM based x-vector clustering for Speaker Diarization

Anglický název

Bayesian HMM based x-vector clustering for Speaker Diarization

Druh

Stať ve sborníku v databázi WoS či Scopus

Originální abstrakt

This paper presents a simplified version of the previously proposeddiarization algorithm based on Bayesian Hidden MarkovModels, which uses Variational Bayesian inference for very fastand robust clustering of x-vector (neural network based speakerembeddings). The presented results show that this clusteringalgorithm provides significant improvements in diarization performanceas compared to the previously used AgglomerativeHierarchical Clustering. The output of this system can be furtheremployed as an initialization for a second stage VB diarizationsystem, using frame-wise MFCC features as input, to obtainoptimal results.

Anglický abstrakt

This paper presents a simplified version of the previously proposeddiarization algorithm based on Bayesian Hidden MarkovModels, which uses Variational Bayesian inference for very fastand robust clustering of x-vector (neural network based speakerembeddings). The presented results show that this clusteringalgorithm provides significant improvements in diarization performanceas compared to the previously used AgglomerativeHierarchical Clustering. The output of this system can be furtheremployed as an initialization for a second stage VB diarizationsystem, using frame-wise MFCC features as input, to obtainoptimal results.

Klíčová slova

Speaker Diarization, Variational Bayes, HMM,x-vector, DIHARD

Klíčová slova v angličtině

Speaker Diarization, Variational Bayes, HMM,x-vector, DIHARD

Autoři

DIEZ SÁNCHEZ, M.; BURGET, L.; WANG, S.; ROHDIN, J.; ČERNOCKÝ, J.

Rok RIV

2020

Vydáno

15.09.2019

Nakladatel

International Speech Communication Association

Místo

Graz

Kniha

Proceedings of Interspeech

ISSN

1990-9772

Periodikum

Proceedings of Interspeech

Svazek

2019

Číslo

9

Stát

Francouzská republika

Strany od

346

Strany do

350

Strany počet

5

URL

BibTex

@inproceedings{BUT159992,
  author="Mireia {Diez Sánchez} and Lukáš {Burget} and Shuai {Wang} and Johan Andréas {Rohdin} and Jan {Černocký}",
  title="Bayesian HMM based x-vector clustering for Speaker Diarization",
  booktitle="Proceedings of Interspeech",
  year="2019",
  journal="Proceedings of Interspeech",
  volume="2019",
  number="9",
  pages="346--350",
  publisher="International Speech Communication Association",
  address="Graz",
  doi="10.21437/Interspeech.2019-2813",
  issn="1990-9772",
  url="https://www.isca-speech.org/archive/Interspeech_2019/pdfs/2813.pdf"
}

Dokumenty