Publication detail

Search Engine for Information Retrieval from Speech Records

FAPŠO, M. SCHWARZ, P. SZŐKE, I. SMRŽ, P. SCHWARZ, M. ČERNOCKÝ, J. KARAFIÁT, M. BURGET, L.

Original Title

Search Engine for Information Retrieval from Speech Records

Type

conference paper

Language

English

Original Abstract

This paper describes a designed and implemented system for efficient storage, indexing and search in collections of spoken documents that takes advantage of  automatic speech recognition. As the quality of current speech recognizers is not sufficient for a great deal of applications, it is necessary to index the ambiguous output of the recognition, i.\,e. the acyclic graphs of word hypotheses --- recognition lattices. Then, it is not possible to directly apply the standard methods known from text-based systems. The paper discusses an optimized indexing system for efficient search in the complex and large data structure that has been developed by our group. The search engine works as a server. The meeting browser JFerret, developed withing the European AMI project,  is used as a client to browse search results.

Keywords

multimedia information retrieval, speech databases

Authors

FAPŠO, M.; SCHWARZ, P.; SZŐKE, I.; SMRŽ, P.; SCHWARZ, M.; ČERNOCKÝ, J.; KARAFIÁT, M.; BURGET, L.

RIV year

2006

Released

16. 2. 2006

Location

Bratislava

Pages from

100

Pages to

101

Pages count

2

BibTex

@inproceedings{BUT22170,
  author="Michal {Fapšo} and Petr {Schwarz} and Igor {Szőke} and Pavel {Smrž} and Milan {Schwarz} and Jan {Černocký} and Martin {Karafiát} and Lukáš {Burget}",
  title="Search Engine for Information Retrieval from Speech Records",
  booktitle="Proceedings of the Third International Seminar on Computer Treatment of Slavic and East European Languages",
  year="2006",
  pages="100--101",
  address="Bratislava"
}