Publication result detail

Towards Lower Error Rates in Phoneme Recognition

SCHWARZ, P., MATĚJKA, P., ČERNOCKÝ, J.

Original Title

Towards Lower Error Rates in Phoneme Recognition

English Title

Towards Lower Error Rates in Phoneme Recognition

Type

Peer-reviewed article not indexed in WoS or Scopus

Original Abstract

We investigate techniques for acoustic modeling in automatic recognition of context-independent phoneme strings from the TIMIT database. The baseline phoneme recognizer is based on TempoRAl Patterns (TRAP). This recognizer is simplified to shorten processing times and reduce computational requirements. More states per phoneme and bi-gram language models are incorporated into the system and evaluated. The question of insufficient amount of training data is discussed and the system is improved. All modifications lead to a faster system with about 23.6% relative improvement over the baseline in phoneme error rate.

English abstract

We investigate techniques for acoustic modeling in automatic recognition of context-independent phoneme strings from the TIMIT database. The baseline phoneme recognizer is based on TempoRAl Patterns (TRAP). This recognizer is simplified to shorten processing times and reduce computational requirements. More states per phoneme and bi-gram language models are incorporated into the system and evaluated. The question of insufficient amount of training data is discussed and the system is improved. All modifications lead to a faster system with about 23.6% relative improvement over the baseline in phoneme error rate.

Keywords

phoneme recognition, traps, speech recognition, feature extraction

Key words in English

phoneme recognition, traps, speech recognition, feature extraction

Authors

SCHWARZ, P., MATĚJKA, P., ČERNOCKÝ, J.

Released

08.09.2004

Publisher

Springer

Location

Spolková republika Německo

Book

Lecture Notes in Computer Science

ISBN

0302-9743

Periodical

Lecture Notes in Computer Science

Volume

2004

Number

3206

State

Federal Republic of Germany

Pages from

465

Pages count

8

URL

BibTex

@article{BUT45379,
  author="Petr {Schwarz} and Pavel {Matějka} and Jan {Černocký}",
  title="Towards Lower Error Rates in Phoneme Recognition",
  journal="Lecture Notes in Computer Science",
  year="2004",
  volume="2004",
  number="3206",
  pages="8",
  issn="0302-9743",
  url="http://www.springerlink.com/index/KBY35VBXY16WHV56"
}