Publication detail

VISUAL FEATURES FOR MULTIMODAL SPEECH RECOGNITION

MOTLÍČEK, P., BURGET, L., ČERNOCKÝ, J.

Original Title

VISUAL FEATURES FOR MULTIMODAL SPEECH RECOGNITION

Type

conference paper

Language

English

Original Abstract

This paper demonstrates the use of visual parameters extracted from video for automatic recognition of phoneme strings. Encouraged by previous works utilizing "visually clean" data we investigate their efficiency in non-ideal conditions which are introduced by meeting audio-visual data employed in our experiments.

Keywords

speech recognition, feature extraction, parameterization, visual features, linear transforms, meeting data

Authors

MOTLÍČEK, P., BURGET, L., ČERNOCKÝ, J.

RIV year

2005

Released

16. 5. 2005

Publisher

Faculty of Electrical Engineering and Communication BUT

Location

Brno

ISBN

80-214-2904-6

Book

Radioelektronika 2005

Pages from

187

Pages to

190

Pages count

4

URL

BibTex

@inproceedings{BUT21499,
  author="Petr {Motlíček} and Lukáš {Burget} and Jan {Černocký}",
  title="VISUAL FEATURES FOR MULTIMODAL SPEECH RECOGNITION",
  booktitle="Radioelektronika 2005",
  year="2005",
  pages="187--190",
  publisher="Faculty of Electrical Engineering and Communication BUT",
  address="Brno",
  isbn="80-214-2904-6",
  url="http://www.fit.vutbr.cz/~motlicek/publi/2005/radioel05.pdf, http://wes.feec.vutbr.cz/UREL/"
}