Power Spectral Densit Equalization of Large Sp Concatenative T
نویسندگان
چکیده
This paper proposes a channel equalization algorithm for a large speech database with application in concatenative TTS systems. The convolutional channel distortion is equalized by comparing the power spectral densities (PSDs) of utterances of different recording sessions. Autoregressive linear filters are designed on a corpus level and are used offline to filter the corresponding sentences to compensate for the relative distortions caused by the channel effects. Two experiments are carried out to evaluate the benefit of the channel equalization approach. First, this method is used to reduce the distance of their PSDs between two recording sessions to verify the effectiveness of the method. Secondly, it is applied practically in the TTS system. The whole TTS speech database is processed to reduce the PSDs variance over all sessions. Moreover, a subjective listening test is carried out to obtain human evaluation of the new TTS system. Almost all listeners prefer the synthetic speech generated by the new TTS system. Furthermore, an analysis of variance (ANOVA) on this subjective listening test demonstrates that the channel equalization process has significant effect on increasing the perceived voice-quality consistency of the TTS system.
منابع مشابه
Adaptive UL-Coded Spectrally-Precoded OFDM With Zero-Forcing Equalization Under Flat Fading
Constant-power adaptive transmission technique adopting UL-coded spectrally-precoded orthogonal frequencydivision multiplexing (SP-OFDM) signals with one tap zeroforcing equalization is studied on the flat fading channel. By jointly adapting the precoding order for spectral precoder and the component modulation for OFDM, constant-power ULcoded adaptive SP-OFDM is shown to outperform conventiona...
متن کاملEstimation of Spectral Mismatch for Joint Cost Evaluation in Marathi TTS
Among different methods of speech synthesis, Concatenative Speech Synthesis is widely used due to its naturalness and less signal processing requirement. But concatenative TTS has problems like requirement of large database and resulting spectral mismatch in output speech. In concatenative TTS position of syllable plays very important role while carrying out segmentation. If proper position syl...
متن کاملA mixed-excitation frequency domain model for time-scale pitch-scale modification of speech
This paper presents a time-scale pitch-scale modification technique for concatenative speech synthesis. The method is based on a frequency domain source-filter model, where the source is modeled as a mixed excitation. This model is highly coupled with a compression scheme that result in compact acoustic inventories. When compared to the approach in the Whistler system using no mixed excitation,...
متن کاملکاربرد آنالیز طیفی بیزی در تحلیل سریهای زمانی نورسنجی
The present paper introduces the Bayesian spectral analysis as a powerful and efficient method for spectral analysis of photometric time series. For this purpose, Bayesian spectral analysis has programmed in Matlab software for XZ Dra photometric time series which is non-uniform with large gaps and the power spectrum of this analysis has compared with the power spectrum which obtained from the ...
متن کاملRate Loss Due to Power Equalization in Cellular Communications
For a cellular communication network, the loss in up-link bandwidth-efficiency resulting from power equalization is considered. Here power equalization refers to the operation of adjusting the transmit power of mobile units in such a way that the received power is the same for all units whereas bandwidth-efficiency is measured in terms of sum-rate per cell divided by the total system bandwidth....
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002