Detail publikačního výsledku

The Kaldi Speech Recognition Toolkit

POVEY, D.; GHOSHAL, A.; BOULIANNE, G.; BURGET, L.; GLEMBEK, O.; GOEL, N.; HANNEMANN, M.; MOTLÍČEK, P.; QIAN, Y.; SCHWARZ, P.; SILOVSKÝ, J.; STEMMER, G.; VESELÝ, K.

Originální název

The Kaldi Speech Recognition Toolkit

Anglický název

The Kaldi Speech Recognition Toolkit

Druh

Stať ve sborníku mimo WoS a Scopus

Originální abstrakt

We described the design of Kaldi, a free and open-sourcespeech recognition toolkit. The toolkit currently supports modellingof context-dependent phones of arbitrary context lengths,and all commonly used techniques that can be estimated usingmaximum likelihood. It also supports the recently proposedSGMMs. Development of Kaldi is continuing and we areworking on using large language models in the FST framework,lattice generation and discriminative training.

Anglický abstrakt

We described the design of Kaldi, a free and open-sourcespeech recognition toolkit. The toolkit currently supports modellingof context-dependent phones of arbitrary context lengths,and all commonly used techniques that can be estimated usingmaximum likelihood. It also supports the recently proposedSGMMs. Development of Kaldi is continuing and we areworking on using large language models in the FST framework,lattice generation and discriminative training.

Klíčová slova

speech recognition, toolkit

Klíčová slova v angličtině

speech recognition, toolkit

Autoři

POVEY, D.; GHOSHAL, A.; BOULIANNE, G.; BURGET, L.; GLEMBEK, O.; GOEL, N.; HANNEMANN, M.; MOTLÍČEK, P.; QIAN, Y.; SCHWARZ, P.; SILOVSKÝ, J.; STEMMER, G.; VESELÝ, K.

Rok RIV

2017

Vydáno

11.12.2011

Nakladatel

IEEE Signal Processing Society

Místo

Hilton Waikoloa Village Resort, Hawaii

ISBN

978-1-4673-0366-8

Kniha

Proceedings of ASRU 2011

Strany od

1

Strany do

4

Strany počet

4

URL

Plný text v Digitální knihovně

BibTex

@inproceedings{BUT127200,
  author="Daniel {Povey} and Arnab {Ghoshal} and Gilles {Boulianne} and Lukáš {Burget} and Ondřej {Glembek} and Nagendra {Goel} and Mirko {Hannemann} and Petr {Motlíček} and Yanmin {Qian} and Petr {Schwarz} and Jan {Silovský} and Georg {Stemmer} and Karel {Veselý}",
  title="The Kaldi Speech Recognition Toolkit",
  booktitle="Proceedings of ASRU 2011",
  year="2011",
  pages="1--4",
  publisher="IEEE Signal Processing Society",
  address="Hilton Waikoloa Village Resort, Hawaii",
  isbn="978-1-4673-0366-8",
  url="http://www.fit.vutbr.cz/research/groups/speech/publi/2011/povey_asru2011_Kaldi%20toolkit.pdf"
}