Project detail

Speech enhancement front-end for robust automatic speech recognition with large amount of training data

Duration: 1.1.2020 — 31.12.2020

Funding resources

Neveřejný sektor - Přímé kontrakty - smluvní výzkum, neveřejné zdroje

On the project

Cílem společného výzkumu je vyvinout technologie parametrizace s obohacováním řeči pro robustní automatické rozpoznávání řeči s velkým objemem trénovacích dat v rámci spolupráce mezi VUT a NTT. Práce je založena na nízkodimenzionálních reprezentacích dat (embeddings) produkovaných neuronovými sítěmi v různých místech řetězce zpracování.

Description in Czech
The purpose of the Joint Research is to develop Speech enhancement front-end for robust automatic speech recognition with large amount of training data through the cooperation of NTT and BUT. The work is relying on embeddings produced by neural networks in various places of the processing chain.

Keywords
speech recognition, robustness, large data, DNN embeddings

Key words in Czech
rozpoznávání řeči, odolnost, velký objem dat,

Default language

English

People responsible

Žmolíková Kateřina, Ing., Ph.D. - principal person responsible

Units

Department of Computer Graphics and Multimedia
- responsible department (10.12.2019 - not assigned)
Speech Data Mining Research Group BUT Speech@FIT
- internal (1.1.2020 - 31.12.2020)
NTT, Inc.
- client (1.1.2020 - 31.12.2020)
Department of Computer Graphics and Multimedia
- beneficiary (1.1.2020 - 31.12.2020)

Responsibility: Žmolíková Kateřina, Ing., Ph.D.

VUT

Faculties and university institutes

Parts

Speech enhancement front-end for robust automatic speech recognition with large amount of training data