Přístupnostní navigace
E-přihláška
Vyhledávání Vyhledat Zavřít
Detail projektu
Období řešení: 1.7.2011 — 30.11.2012
Zdroje financování
Evropská unie - Seventh Research Framework Programme
Ministerstvo školství, mládeže a tělovýchovy ČR - Podpora projektů sedmého rámcového programu Evropského společenství pro výzkum, technologický rozvoj a demonstrace (2007 až 2013) podle zákona č. 171/2007 Sb.
Evropská unie - Sedmý rámcový program Evropského společenství pro výzkum, technologický rozvoj a demonstrace
O projektu
The key idea underlying project GLOCAL is to use events as the primary means for organizing and indexing media. Events have a local and a global dimension. The local dimension involves the assignment of tags (conceptualizations) to media (personal experiences). The global dimension involves the sharing of general event structures and specific event descriptions, which enables social sharing and networking of events, tags and media. Within networked communities, common (global) descriptions of the world can be built and continuously enriched by a continuous flow of individual (local) descriptions. One of the project enabling technologies deals with the analysis of multimedia content. The running GLOCAL project focuses mainly on text, image and video analysis. The recent study of user requirements showed that the project would benefit from advanced speech recognition components. The proposed GLOCAL project extension will bring additional functionality in the area of context-aware semantic annotation of multimedia files based on sophisticated audio and video processing. The search on the multimedia files will overcome limits of the current "one-best" voice transcription systems by employing graph-based representation of all probable alternative speech transcripts. A special attention will be paid to the detection of OOV (out-of-vocabulary) words. In contrast to the existing state-of-the-art speech processing systems, the GLOCAL-EEU will also provide sufficient accuracy when searching for new names of persons, places, companies etc. To address this, the project will combine the vocabulary-based speech recognizer with a keyword spotting module implementing the functionality of phonetic search. The addition of BUT to the consortium will help to lessen the impact of the semantic gap by exploiting the local and global (GLOCAL) knowledge about events and related contents to locally index media and experiences and later to globally share them within networked communities. This will improve the end-user experience with the GLOCAL-enhanced technologies and hence make the project more effective and more efficient.
Popis českyThe key idea underlying project GLOCAL is to use events as the primary means for organizing and indexing media. Events have a local and a global dimension. The local dimension involves the assignment of tags (conceptualizations) to media (personal experiences). The global dimension involves the sharing of general event structures and specific event descriptions, which enables social sharing and networking of events, tags and media. Within networked communities, common (global) descriptions of the world can be built and continuously enriched by a continuous flow of individual (local) descriptions. One of the project enabling technologies deals with the analysis of multimedia content. The running GLOCAL project focuses mainly on text, image and video analysis. The recent study of user requirements showed that the project would benefit from advanced speech recognition components. The proposed GLOCAL project extension will bring additional functionality in the area of context-aware semantic annotation of multimedia files based on sophisticated audio and video processing. The search on the multimedia files will overcome limits of the current "one-best" voice transcription systems by employing graph-based representation of all probable alternative speech transcripts. A special attention will be paid to the detection of OOV (out-of-vocabulary) words. In contrast to the existing state-of-the-art speech processing systems, the GLOCAL-EEU will also provide sufficient accuracy when searching for new names of persons, places, companies etc. To address this, the project will combine the vocabulary-based speech recognizer with a keyword spotting module implementing the functionality of phonetic search. The addition of BUT to the consortium will help to lessen the impact of the semantic gap by exploiting the local and global (GLOCAL) knowledge about events and related contents to locally index media and experiences and later to globally share them within networked communities. This will improve the end-user experience with the GLOCAL-enhanced technologies and hence make the project more effective and more efficient.
Klíčová slova event analysis, speech processing
Označení
7E12055
Originální jazyk
angličtina
Řešitelé
Smrž Pavel, doc. RNDr., Ph.D. - hlavní řešitel
Útvary
Ústav počítačové grafiky a multimédií- odpovědné pracoviště (6.6.2011 - nezadáno)Výzkumná skupina dolování dat z řeči BUT Speech@FIT- interní (6.6.2011 - 30.11.2012)Výzkumná skupina počítačové grafiky- interní (6.6.2011 - 30.11.2012)Výzkumná skupina znalostních technologií- interní (6.6.2011 - 30.11.2012)Ústav počítačové grafiky a multimédií- příjemce (6.6.2011 - 30.11.2012)
Výsledky
HRADIŠ, M.; KOLÁŘ, M.; KRÁL, J.; LÁNÍK, A.; ZEMČÍK, P.; SMRŽ, P. Annotating images with suggestions - user study of a tagging system. Advanced Concepts for Intelligent Vision Systems. Lecture Notes in Computer Science. Lecture Notes in Computer Science. Brno: Springer Verlag, 2012. no. 7517, p. 155-166. ISBN: 978-3-642-33139-8. ISSN: 0302-9743.Detail
KRÁL, J.; HRADIŠ, M. Restricted Boltzman Machines for Image Tag Suggestion. Proceedings of the 19th Conference STUDENT EEICT 2012. Brno: Brno University of Technology, 2012. p. 1-5. Detail
Odkaz
http://www.glocal-project.eu/
Odpovědnost: Smrž Pavel, doc. RNDr., Ph.D.