نتایج جستجو برای: Mel Frequency Cel Cepstrum (MFCC)

تعداد نتایج: 490625  

1998
Hiroshi Matsumoto Yoshihisa Nakatoh Yoshinori Furuhata

This paper proposes a simple and e cient time domain technique to estimate an all-poll model on a mel-frequency axis (Mel-LPC). This method requires only two-fold computational cost as compared to conventional linear prediction analysis. The recognition performance of mel-cepstral parameters obtained by the Mel LPC analysis is compared with those of conventional LP mel-cepstra and the melfreque...

2001
Hiroshi Matsumoto Masanori Moroto

This paper presents a simple and e cient time domain technique to estimate an all-pole model on the melfrequency scale (Mel-LPC), and compares the recognition performance of Mel-LPC cepstrum with those of both the standard LPC mel-cepstrum and the MFCC through the Japanese dictation system (Julius) with 20,000 word vocabulary. First, the optimal value of frequency warping factor is examined in ...

2003
Michael Pitz Hermann Ney

We have shown previously that vocal tract normalization (VTN) results in a linear transformation in the cepstral domain. In this paper we show that Mel-frequency warping can equally well be integrated into the framework of VTN as linear transformation on the cepstrum. We show examples of transformation matrices to obtain VTN warped Mel-frequency cepstral coefficients (VTN-MFCC) as linear transf...

2004
Hyoung-Gook Kim Martin Haller Thomas Sikora

Our purpose is to evaluate the efficiency of MPEG-7 basis projection (BP) features vs. Mel-scale Frequency Cepstrum Coefficients (MFCC) for speaker recognition in noisy environments. The MPEG-7 feature extraction mainly consists of a Normalized Audio Spectrum Envelope (NASE), a basis decomposition algorithm and a spectrum basis projection. Prior to the feature extraction the noise reduction alg...

2014
S. R. Ganorkar

This paper suggests Digital Signal processor (DSP) based speech recognition system with improved performance in terms of recognition accuracies and computational cost. The comprehensive surrey of various approaches of feature extraction like Mel filter banks with Mel Frequency Cepstrum Coefficients (MFCC). This paper describes an approach of isolated speech recognition by Digital Signal Process...

2014
Nidhi Srivastava

Humans prefer to interact with each other using speech. Since this is the most natural mode of communication, the humans also want to interact with machines using speech only. So, automatic speech recognition has gained a lot of popularity. Different approaches for speech recognition exists like Hidden Markov Model (HMM), Dynamic Time Warping (DTW), Vector Quantization (VQ), etc. This paper use...

2015
Shaik Shafee

In this paper, Radial basis neural networks[1][12][17] have been examined for speech recognition using speech features MFCC (Mel frequency Coefficients) and Gamma tone frequency coefficients for isolated Telugu words in noisy environment. Speech feature vectors are used to train, validate and test the Radial basis neural networks.Experiments conducted in Office environment under the presence of...

2000
S. Umesh Richard C. Rose Sarangarajan Parthasarathy

An experimental study of the application of scale-transform to improve the performance of speaker independent continuous speech recognition, is presented in this paper. Three major results are described. First, a comparison was made between the scale-transform based magnitude cepstrum coeÆcients (STCC) and mel-scale lter bank cepstrum coeÆcients (MFCC) on a telephone based connected digit recog...

2012
BENYAMIN KUSUMOPUTRO

Power-spectrum-based Mel-Frequency Cepstrum Coefficients (MFCC) is usually used as a feature extractor in a speaker identification system. This one-dimensional feature extraction subsystem, however, shows low recognition rates for identifying utterance speech signals under harsh noise conditions. In this paper, we have developed a speaker identification system based on Bispectrum data that is m...

2010
Atanas Ouzounov

In the study, the effectiveness of combinations of cepstral features, channel compensation techniques, and different local distances in the Dynamic Time Warping (DTW) algorithm is experimentally evaluated in the text-dependent speaker identification task. The training and the testing has been done with noisy telephone speech (short phrases in Bulgarian with length of about 2 seconds) selected f...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید