R&D Result Detail

Original Title

State-Space Representation of Cepstral Vocal Tract Model for DSP Implementation

English Title

State-Space Representation of Cepstral Vocal Tract Model for DSP Implementation

Type

Peer-reviewed article not indexed in WoS or Scopus

Original Abstract

One of the methods of speech production in text-to-speech (TTS) synthesis is the parametric method. An excitation signal excites the vocal-tract model with time-varying parameters. A new state-space cepstral vocal-tract model is described, which approximates both the formants and the antiformants of the model frequency response for voiced and unvoiced speech sounds. It thus differs from the currently used LPC model, which approximates the formants alone. Unlike methods of the type PSOLA, this method is convenient for prosody modelling and requires less memory capacity. The cepstral speech synthesis starts from the cepstral coefficients obtained by analysing the speech signal. In the paper a new structure of parametric vocal-tract model is proposed, which is formed by combining IIR and FIR digital filters. The model is optimised with respect to implementation on a fixed-point digital signal processor with Harvard architecture.

English abstract

One of the methods of speech production in text-to-speech (TTS) synthesis is the parametric method. An excitation signal excites the vocal-tract model with time-varying parameters. A new state-space cepstral vocal-tract model is described, which approximates both the formants and the antiformants of the model frequency response for voiced and unvoiced speech sounds. It thus differs from the currently used LPC model, which approximates the formants alone. Unlike methods of the type PSOLA, this method is convenient for prosody modelling and requires less memory capacity. The cepstral speech synthesis starts from the cepstral coefficients obtained by analysing the speech signal. In the paper a new structure of parametric vocal-tract model is proposed, which is formed by combining IIR and FIR digital filters. The model is optimised with respect to implementation on a fixed-point digital signal processor with Harvard architecture.

Keywords

Text-To-Speech Synthesis, Cepstral Vocal Tract Model

Key words in English

Text-To-Speech Synthesis, Cepstral Vocal Tract Model

Authors

SMÉKAL, Z., VONDRA, M., VÍCH, R.

Released

03.09.2002

ISBN

1213-161X

Periodical

ElectronicsLetters.com - http://www.electronicsletters.com

Volume

2002

Number

9

State

Czech Republic

Pages from

1

Pages count

10

BibTex

@article{BUT40781,
  author="Zdeněk {Smékal} and Martin {Vondra} and Robert {Vích}",
  title="State-Space Representation of Cepstral Vocal Tract Model for DSP Implementation",
  journal="ElectronicsLetters.com - http://www.electronicsletters.com",
  year="2002",
  volume="2002",
  number="9",
  pages="10",
  issn="1213-161X"
}

VUT

Faculties and university institutes

Parts

State-Space Representation of Cepstral Vocal Tract Model for DSP Implementation