Speech modeling and processing by low-dimensional dynamic glottal models
نویسندگان
چکیده
We discuss the use of low-dimensional physical models of the voice source for speech coding and processing applications. A class of waveform-adaptive dynamic glottal models and parameter tracking procedures are illustrated. The model and analysis procedures are assessed by addressing signal transformations on recorded speech, achievable by fitting the model to the data, and then acting on the physically-oriented parameters of the voice source. The class of models proposed provides in principle a tool for both the estimation of glottal source signals, and the encoding of the speech signal for transformation purposes. The application of this model to time stretching and to frequency control (pitch shifting) is also illustrated. The experiments show that copy synthesis is perceptually almost indistinguishable form the target, and that time stretching and ”pitch extrapolation” effects can be obtained by simple control strategies.
منابع مشابه
Voice Processing by Dynamic Glottal Models with Applications to Speech Enhancement
We discuss the use of low-dimensional physical models of the voice source for speech coding and processing applications. A class of waveform-adaptive dynamic glottal models and parameter tracking procedures are illustrated. The model and analysis procedures are assessed by addressing speech encoding and enhancement, achievable by using a state space version of the dynamical model in a Extended ...
متن کاملSpeaker adaptive voice source modeling with applications to speech coding and processing
We discuss the use of low-dimensional physical models of the voice source for speech coding and processing applications. class of waveform-adaptive dynamic glottal models and parameter identification procedures are illustrated. The model and the dentification procedures are assessed by addressing signal transformations on recorded speech, achievable by fitting the model to the ata, and then act...
متن کاملVoiced Speech Synthesis Using Pitch Asynchronous Code Excited Linear Filters for the Glottal Source
This paper proposes a model for natural quality voiced speech synthesis using code excited linear all-pole filter for modeling the glottal source signal. Classical glottal signal models are explicit-time functions which inhibit joint sourcetract parameter estimation and require pitch synchronous estimation with precise segmentation of open and closed glottis phase. These problems are overcome i...
متن کاملEffect of different jitter-induced glottal pulse shape changes in periodicity perturbation measures
Jitter has long been used to describe period instability in voiced speech signals. In spite of this long history of measuring and modeling jitter, the ways the different glottal pulse phases are affected by jitter differ across studies. The models have quite dissimilar implications, and their selection has been rather arbitrary in the literature. This paper describes different choices for model...
متن کاملAdvances in Glottal Analysis and its Applications
From artificial voices in GPS to automatic systems of dictation, from voice-based identity verification to voice pathology detection, speech processing applications are nowadays omnipresent in our daily life. By offering solutions to companies seeking for efficiency enhancement with simultaneous cost saving, the market of speech technology is forecast to be particularly promising in the next ye...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012