Detail publikačního výsledku

Cognitive Role of Speech Pauses and Algorithmic Consideration for their Processing

SMÉKAL, Z.; STEJSKAL, V.; ESPOSITO, A.

Originální název

Cognitive Role of Speech Pauses and Algorithmic Consideration for their Processing

Anglický název

Cognitive Role of Speech Pauses and Algorithmic Consideration for their Processing

Druh

Článek recenzovaný mimo WoS a Scopus

Originální abstrakt

This study investigates pausing strategies, focusing the attention on empty speech pauses. A cross-modal analysis (video and audio) of spontaneous narratives produced by male and female children and adults showed that a remarkable amount of empty speech pauses was used to signal new concepts in the speech flow and to segment discourse units such as clauses and paragraphs. Based on these results, an adaptive mathematical model for pause distribution was suggested, that exploits, as pause features, the absence of signal and/or the changes of energy over different acoustic dimensions strongly related to the auditory perception. These considerations inspired the formulation and the implementation of two pause detection procedures that proved to be more effective than the Likelihood Ratio Test (LRT) and Long-Term Spectral Divergence (LTSD) algorithms recently proposed in literature and applied for Voice Activity Detection (VAD).

Anglický abstrakt

This study investigates pausing strategies, focusing the attention on empty speech pauses. A cross-modal analysis (video and audio) of spontaneous narratives produced by male and female children and adults showed that a remarkable amount of empty speech pauses was used to signal new concepts in the speech flow and to segment discourse units such as clauses and paragraphs. Based on these results, an adaptive mathematical model for pause distribution was suggested, that exploits, as pause features, the absence of signal and/or the changes of energy over different acoustic dimensions strongly related to the auditory perception. These considerations inspired the formulation and the implementation of two pause detection procedures that proved to be more effective than the Likelihood Ratio Test (LRT) and Long-Term Spectral Divergence (LTSD) algorithms recently proposed in literature and applied for Voice Activity Detection (VAD).

Klíčová slova

Speech a empty pauses discrimination, adaptive algorithms

Klíčová slova v angličtině

Speech a empty pauses discrimination, adaptive algorithms

Autoři

SMÉKAL, Z.; STEJSKAL, V.; ESPOSITO, A.

Rok RIV

2010

Vydáno

01.06.2008

Nakladatel

World Scientific Publications

Místo

Singapore

ISSN

0218-0014

Periodikum

INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE

Svazek

22

Číslo

5

Stát

Singapurská republika

Strany od

1073

Strany do

1088

Strany počet

16

BibTex

@article{BUT47246,
  author="Zdeněk {Smékal} and Vojtěch {Stejskal} and Anna {Esposito}",
  title="Cognitive Role of Speech Pauses and Algorithmic Consideration for their Processing",
  journal="INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE",
  year="2008",
  volume="22",
  number="5",
  pages="1073--1088",
  issn="0218-0014"
}