Single complex sinusoid and ARHE model based pitch extractors
نویسندگان
چکیده
In this paper we propose two techniques for the estimation of the fundamental frequency of speech signals. The rst technique is based on the Autoregressive Harmonic Excitation (ARHE) speech model. ARHE model consists of an autoregressive process driven simultaneously by white noise and a periodic excitation. The second technique is based on the estimation of a complex sinusoid in white Gaussian noise. It uses the Hilbert transform of the speech signal and the derivative of its phase function over the time. The derivative of the phase information is seen as a simple model of a moving average process driven by noise. The fundamental frequency is obtained by the minimum variance estimator of the model. The proposed methods have comparable performance to previous reported pitch detectors while they maintain their performance under noisy conditions.
منابع مشابه
Effect of Wire Pitch on Capacity of Single Staggered Wire and Tube Heat Exchanger Using Computational Fluid Dynamic Simulation
Single staggered is a design development of normal wire and tube heat exchanger that wires are welded with staggered configuration on two sides. Capacity of wire and tube heat exchanger is the ability of the heat exchanger to release heat. The objective of this study is to analyse the effect of wire pitch (pw) on capacity of single staggered wire and tube heat exchanger. The research...
متن کاملمطالعه درجات اصلی گام موسیقی ایرانی از روی طیف نتهای گام
In this paper we have extracted the notes of Iranian scale from the traditional music played by the great musician Shahnazi on the TAR. Then, by analyzing the spectrum of the notes and by using our special averaging we have found the pitch attributed to the components’ frequency and found the interval between the notes. The results are in comple agreement with Pythagorean scale. Pitch is a su...
متن کاملAuditory-Model Based Methods for Multiple Fundamental Frequency Estimation
This chapter describes fundamental frequency (F0) estimation methods that make use of computational models of the human auditory perception and especially pitch perception. At the present time, the most reliable music transcription system available is the ears and the brain of a trained musician. Compared with any artificial audio processing tool, the analytical ability of human hearing is very...
متن کاملF0 parameterization of glottalized tones for HMM-based vietnamese TTS
A conventional HMM-based TTS system for Hanoi Vietnamese often suffers from the hoarse quality due to the incomplete F0 parameterization of glottalized tones. As estimating F0 in glottalization is rather problematic for usual F0 extractors, we propose a pitch marking algorithm where the pitch marks are propagated from regular regions of speech signal to glottalized one, from which the complete ...
متن کاملUsing Context-based Statistical Models to Promote the Quality of Voice Conversion Systems
This article aims to examine methods of optimizing GMM-based voice conversion systems performance in which GMM method is introduced as the basic method for improvement of voice conversion systems performance. In the current methods, due to using a single conversion function to convert all speech units and subsequent spectral smoothing arising from statistical averaging, we will observe quality ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999