Reconstruction of normal sounding speech for laryngectomy patients through a modified CELP codec

Sharifzadeh, Hamid Reza, McLoughlin, Ian Vince, Ahmadi, Farzaneh (2010) Reconstruction of normal sounding speech for laryngectomy patients through a modified CELP codec. IEEE Transactions on Biomedical Engineering, 57 (10). pp. 2448-2458. ISSN 0018-9294. (doi:10.1109/TBME.2010.2053369) (The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided) (KAR id:48914)

The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided.
Official URL: http://dx.doi.org/ 10.1109/TBME.2010.2053369

Abstract

Whispered speech can be useful for quiet and private communication, and is the primary means of unaided spoken communication for many people experiencing voice-box deficiencies. Patients who have undergone partial or full laryngectomy are typically unable to speak anything more than hoarse whispers, without the aid of prostheses or specialized speaking techniques. Each of the current prostheses and rehabilitative methods for post-laryngectomized patients (primarily oesophageal speech, tracheo-esophageal puncture, and electrolarynx) have particular disadvantages, prompting new work on nonsurgical, noninvasive alternative solutions. One such solution, described in this paper, combines whisper signal analysis with direct formant insertion and speech modification located outside the vocal tract. This approach allows laryngectomy patients to regain their ability to speak with a more natural voice than alternative methods, by whispering into an external prosthesis, which then, recreates and outputs natural-sounding speech. It relies on the observation that while the pitch-generation mechanism of laryngectomy patients is damaged or unusable, the remaining components of the speech production apparatus may be largely unaffected. This paper presents analysis and reconstruction methods designed for the prosthesis, and demonstrates their ability to obtain natural-sounding speech from the whisper-speech signal using an external analysis-by-synthesis processing framework.

Item Type:	Article
DOI/Identification number:	10.1109/TBME.2010.2053369
Uncontrolled keywords:	Analysis-by-synthesis, Bionic voice, Laryngectomy, Rehabilitation, Speech processing, Whispers
Subjects:	T Technology
Institutional Unit:	Schools > School of Computing
Former Institutional Unit:	Divisions > Division of Computing, Engineering and Mathematical Sciences > School of Computing
Depositing User:	Ian McLoughlin
Date Deposited:	07 Sep 2015 15:12 UTC
Last Modified:	28 Apr 2026 08:16 UTC
Resource URI:	https://kar.kent.ac.uk/id/eprint/48914 (The current URI for this page, for reference purposes)

University of Kent Author Information

McLoughlin, Ian Vince.

Creator's ORCID:	https://orcid.org/0000-0001-7111-2008
CReDIT Contributor Roles:

Depositors only (login required):

Altmetric

Total Views

Total unique views of this page since July 2020. For more details click on the image.