Mel Frequency Cel Cepstrum (MFCC)

نتایج جستجو برای: Mel Frequency Cel Cepstrum (MFCC)

تعداد نتایج: 490625 فیلتر نتایج به سال:

An efficient mel-LPC analysis method for speech recognition

1998

Hiroshi Matsumoto Yoshihisa Nakatoh Yoshinori Furuhata

This paper proposes a simple and e cient time domain technique to estimate an all-poll model on a mel-frequency axis (Mel-LPC). This method requires only two-fold computational cost as compared to conventional linear prediction analysis. The recognition performance of mel-cepstral parameters obtained by the Mel LPC analysis is compared with those of conventional LP mel-cepstra and the melfreque...

متن کامل

Evaluation of mel-LPC cepstrum in a large vocabulary continuous speech recognition

2001

Hiroshi Matsumoto Masanori Moroto

This paper presents a simple and e cient time domain technique to estimate an all-pole model on the melfrequency scale (Mel-LPC), and compares the recognition performance of Mel-LPC cepstrum with those of both the standard LPC mel-cepstrum and the MFCC through the Japanese dictation system (Julius) with 20,000 word vocabulary. First, the optimal value of frequency warping factor is examined in ...

متن کامل

Vocal tract normalization as linear transformation of MFCC

2003

Michael Pitz Hermann Ney

We have shown previously that vocal tract normalization (VTN) results in a linear transformation in the cepstral domain. In this paper we show that Mel-frequency warping can equally well be integrated into the framework of VTN as linear transformation on the cepstrum. We show examples of transformation matrices to obtain VTN warped Mel-frequency cepstral coefficients (VTN-MFCC) as linear transf...

متن کامل

Comparison of MPEG-7 basis projection features and MFCC applied to robust speaker recognition

2004

Hyoung-Gook Kim Martin Haller Thomas Sikora

Our purpose is to evaluate the efficiency of MPEG-7 basis projection (BP) features vs. Mel-scale Frequency Cepstrum Coefficients (MFCC) for speaker recognition in noisy environments. The MPEG-7 feature extraction mainly consists of a Normalized Audio Spectrum Envelope (NASE), a basis decomposition algorithm and a spectrum basis projection. Prior to the feature extraction the noise reduction alg...

متن کامل

Hardware Implementation of Speech Recognition Using MFCC and Euclidean Distance

2014

S. R. Ganorkar

This paper suggests Digital Signal processor (DSP) based speech recognition system with improved performance in terms of recognition accuracies and computational cost. The comprehensive surrey of various approaches of feature extraction like Mel filter banks with Mel Frequency Cepstrum Coefficients (MFCC). This paper describes an approach of isolated speech recognition by Digital Signal Process...

متن کامل

Speech Recognition using Artificial Neural Network

2014

Nidhi Srivastava

Humans prefer to interact with each other using speech. Since this is the most natural mode of communication, the humans also want to interact with machines using speech only. So, automatic speech recognition has gained a lot of popularity. Different approaches for speech recognition exists like Hidden Markov Model (HMM), Dynamic Time Warping (DTW), Vector Quantization (VQ), etc. This paper use...

متن کامل

Isolated Telugu Speech Recognition using MFCC and Gamma tone features by Radial Basis Networks in Noisy Environment

2015

Shaik Shafee

In this paper, Radial basis neural networks[1][12][17] have been examined for speech recognition using speech features MFCC (Mel frequency Coefficients) and Gamma tone frequency coefficients for isolated Telugu words in noisy environment. Speech feature vectors are used to train, validate and test the Radial basis neural networks.Experiments conducted in Office environment under the presence of...

متن کامل

Exploiting frequency-scaling invariance properties of the scale transform for automatic speech recognition

2000

S. Umesh Richard C. Rose Sarangarajan Parthasarathy

An experimental study of the application of scale-transform to improve the performance of speaker independent continuous speech recognition, is presented in this paper. Three major results are described. First, a comparison was made between the scale-transform based magnitude cepstrum coeÆcients (STCC) and mel-scale lter bank cepstrum coeÆcients (MFCC) on a telephone based connected digit recog...

متن کامل

Identification of Noisy Speech Signals using Bispectrum-based 2D- MFCC and Its Optimization through Genetic Algorithm as a Feature Extraction Subsystem

2012

BENYAMIN KUSUMOPUTRO

Power-spectrum-based Mel-Frequency Cepstrum Coefficients (MFCC) is usually used as a feature extractor in a speaker identification system. This one-dimensional feature extraction subsystem, however, shows low recognition rates for identifying utterance speech signals under harsh noise conditions. In this paper, we have developed a speaker identification system based on Bispectrum data that is m...

متن کامل

Cepstral Features and Text-Dependent Speaker Identification – A Comparative Study

2010

Atanas Ouzounov

In the study, the effectiveness of combinations of cepstral features, channel compensation techniques, and different local distances in the Dynamic Time Warping (DTW) algorithm is experimentally evaluated in the text-dependent speaker identification task. The training and the testing has been done with noisy telephone speech (short phrases in Bulgarian with length of about 2 seconds) selected f...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید