Speech Reconstruction in Post-Laryngectomised Patients by Formant Manipulation and Pitch Profile Generation
نویسندگان
چکیده
rehabilitation of the ability to speak in a natural sounding voice, for patients who suffer larynx and voice box deficiencies, has long been a dream for both patients and researchers working in this field. Removal of, or damage to, the voice box in a surgical operation such as laryngectomy, affects the pitch generation mechanism of the human voice production system. Such patients speech thus becomes hoarse, whisper like and sometimes not easily perceptible. This speech is obviously different to that from normal speakers, and will have lost many of the distinctive characteristics of the original speech. However, these patients typically retain the ability to whisper in a similar way to normal speakers. This paper aims to present an engineering approach to providing laryngectomy patients the capacity to regain their ability to speak with a more natural voice, and as a side effect, to allow them to conveniently use a mobile phone for communications. The method uses auditory information only, allied with analysis, formant insertion and novel methods for spectrum enhancement and formant smoothing within the reconstruction process. In effect, natural sounding speech is obtained from their spoken whisper-speech, without recourse to surgery. The method builds upon our previously published works using an analysis-by-synthesis approach for voice reconstruction with a modified CELP codec.
منابع مشابه
Statistical Variation Analysis of Formant and Pitch Frequencies in Anger and Happiness Emotional Sentences in Farsi Language
Setup of an emotion recognition or emotional speech recognition system is directly related to how emotion changes the speech features. In this research, the influence of emotion on the anger and happiness was evaluated and the results were compared with the neutral speech. So the pitch frequency and the first three formant frequencies were used. The experimental results showed that there are lo...
متن کاملReconstruction of Dysphonic Speech by MELP
The chronical dysphony is the result of neural, structural or pathological effects on the vocal cords or larynx and it causes undesirable changes in the quality of speech. This paper presents a Mixed Excitation Linear Prediction (MELP) based system that reconstructs normally phonated speech from dysphonic speech, while preserving the individuality of the patient. The proposed system can be used...
متن کاملThe Study of Vowel Space and Formant Structure in Mazani Language
Objective: One of the parameters showing the correct phonetic and phonological development is the correct and clear articulation of vowels is achieved by changing the shape of vocal cords through altering the height and position of the tongue and the movement of the lips and jaw. The tongue’s height and position are the basis of the production and difference of vowels. In other words, the raw s...
متن کاملPsychoacoustical evaluation of the pitch-synchronous overlap-and-add speech-waveform manipulation technique using single-formant stimuli.
This article presents two experiments dealing with a psychoacoustical evaluation of the pitch-synchronous overlap-and-add (PSOLA) technique. This technique has been developed for modification of duration and fundamental frequency of speech and is based on simple waveform manipulations. Both experiments were aimed at deriving the sensitivity of the auditory system to the basic distortions introd...
متن کاملA real-time variable-q non-stationary Gabor transform for pitch shifting
This paper proposes a real-time variable-Q non-stationary Gabor transform (VQ-NSGT) system for speech pitch shifting. The system allows for time-frequency representations of speech on variable-Q (VQ) with perfect reconstruction and computational efficiency. The proposed VQ-NSGT phase vocoder can be used for pitch shifting by simple frequency translation (transposing partials along the frequency...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009