Skip to main content

Analysis-by-synthesis method for whisper-speech reconstruction

Ahmadi, Farzaneh, McLoughlin, Ian Vince, Sharifzadeh, Hamid Reza (2008) Analysis-by-synthesis method for whisper-speech reconstruction. In: IEEE Asia Pacific Conference on Circuits and Systems, 2008. APCCAS 2008. . pp. 1280-1283. IEEE ISBN 978-1-4244-2341-5. E-ISBN 978-1-4244-2342-2. (doi:10.1109/APCCAS.2008.4746261) (The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided)

The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided. (Contact us about this Publication)
Official URL
http://dx.doi.org/10.1109/APCCAS.2008.4746261

Abstract

In the following paper, a method for the real-time conversion of whispers to normal phonated speech through a code excited linear prediction analysis-by-synthesis codec is discussed. This approach uses a template of a speakerpsilas normal phonated speech for extraction of excitation parameters such as pitch and gain, and then injects these estimated excitations into whispered signal to synthesize normal-sounding speech through the CELP codec. Furthermore, since restoring pitch to whispered speech requires some considerations of quality and accuracy, spectral enhancements are required in terms of formant shifting (LSPs modification) and pitch injection based on voiced/unvoiced decision. Spectral shifting is accomplished through line-spectral pair adjustment. Implementing such methods by using the popular CELP codec allows integration of the technique with any modern speech applications and devices. Subjective testing results are presented to determine the effectiveness of the technique.

Item Type: Conference or workshop item (Paper)
DOI/Identification number: 10.1109/APCCAS.2008.4746261
Subjects: T Technology
Divisions: Faculties > Sciences > School of Computing
Depositing User: Ian McLoughlin
Date Deposited: 07 Sep 2015 14:17 UTC
Last Modified: 29 May 2019 14:38 UTC
Resource URI: https://kar.kent.ac.uk/id/eprint/48770 (The current URI for this page, for reference purposes)
McLoughlin, Ian Vince: https://orcid.org/0000-0001-7111-2008
  • Depositors only (login required):