Detail publikačního výsledku

Utilizing VOiCES dataset for multichannel speaker verification with beamforming

MOŠNER, L.; PLCHOT, O.; ROHDIN, J.; ČERNOCKÝ, J.

Originální název

Utilizing VOiCES dataset for multichannel speaker verification with beamforming

Anglický název

Utilizing VOiCES dataset for multichannel speaker verification with beamforming

Druh

Stať ve sborníku mimo WoS a Scopus

Originální abstrakt

VOiCES from a Distance Challenge 2019 aimed at the evaluationof speaker verification (SV) systems using single-channeltrials based on the Voices Obscured in Complex EnvironmentalSettings (VOiCES) corpus. Since it comprises recordingsof the same utterances captured simultaneously by multiple microphonesin the same environments, it is also suitable for multichannelexperiments. In this work, we design a multichanneldataset as well as development and evaluation trials forSV inspired by the VOiCES challenge. Alternatives discardingharmful microphones are presented as well. We asses theutilization of the created dataset for x-vector based SV withbeamforming as a front end. Standard fixed beamforming andNN-supported beamforming using simulated data and ideal binarymasks (IBM) are compared with another variant of NNsupportedbeamforming that is trained solely on the VOiCESdata. Lack of data revealed by experiments with VOiCESdatatrained beamformer was tackled by means of a variant ofSpecAugment applied to magnitude spectra. This approach ledto as much as 10% relative improvement in EER pushing resultscloser to those obtained by a good beamformer based on IBMs.

Anglický abstrakt

VOiCES from a Distance Challenge 2019 aimed at the evaluationof speaker verification (SV) systems using single-channeltrials based on the Voices Obscured in Complex EnvironmentalSettings (VOiCES) corpus. Since it comprises recordingsof the same utterances captured simultaneously by multiple microphonesin the same environments, it is also suitable for multichannelexperiments. In this work, we design a multichanneldataset as well as development and evaluation trials forSV inspired by the VOiCES challenge. Alternatives discardingharmful microphones are presented as well. We asses theutilization of the created dataset for x-vector based SV withbeamforming as a front end. Standard fixed beamforming andNN-supported beamforming using simulated data and ideal binarymasks (IBM) are compared with another variant of NNsupportedbeamforming that is trained solely on the VOiCESdata. Lack of data revealed by experiments with VOiCESdatatrained beamformer was tackled by means of a variant ofSpecAugment applied to magnitude spectra. This approach ledto as much as 10% relative improvement in EER pushing resultscloser to those obtained by a good beamformer based on IBMs.

Klíčová slova

multichannel speaker verification, application-aware beamforming

Klíčová slova v angličtině

multichannel speaker verification, application-aware beamforming

Autoři

MOŠNER, L.; PLCHOT, O.; ROHDIN, J.; ČERNOCKÝ, J.

Rok RIV

2021

Vydáno

01.11.2020

Nakladatel

International Speech Communication Association

Místo

Tokyo

Kniha

Proceedings of Odyssey 2020 The Speaker and Language Recognition Workshop

ISSN

2312-2846

Periodikum

Proceedings of Odyssey: The Speaker and Language Recognition Workshop Odyssey 2014, Joensuu, Finland

Svazek

2020

Číslo

11

Stát

Finská republika

Strany od

187

Strany do

193

Strany počet

7

URL

BibTex

@inproceedings{BUT164069,
  author="Ladislav {Mošner} and Oldřich {Plchot} and Johan Andréas {Rohdin} and Jan {Černocký}",
  title="Utilizing VOiCES dataset for multichannel speaker verification with beamforming",
  booktitle="Proceedings of Odyssey 2020 The Speaker and Language Recognition Workshop",
  year="2020",
  journal="Proceedings of Odyssey: The Speaker and Language Recognition Workshop Odyssey 2014, Joensuu, Finland",
  volume="2020",
  number="11",
  pages="187--193",
  publisher="International Speech Communication Association",
  address="Tokyo",
  doi="10.21437/Odyssey.2020-27",
  issn="2312-2846",
  url="https://www.isca-speech.org/archive/Odyssey_2020/abstracts/80.html"
}

Dokumenty