mel frequency cel cepstrum mfcc

نتایج جستجو برای: mel frequency cel cepstrum mfcc

تعداد نتایج: 490625 فیلتر نتایج به سال:

Analysis of the root-cepstrum for acoustic modeling and fast decoding in speech recognition

2001

Ruhi Sarikaya John H. L. Hansen

Root-cepstral analysis has been proposed previously for speech recognition in car environments [9]. In this paper, we focus on an alternative aspect of Root-cepstrum as it applies to discriminative acoustic modeling and fast speech recognizer decoding. We compare Root-cepstrum to Mel-Frequency cepstrum Coefficients (MFCC) in terms of their noise immunity during modeling and decoding speed. Our ...

متن کامل

Spectral Subband Centroids as Complementary Features for Speaker Authentication

2004

Norman Poh Conrad Sanderson Samy Bengio

Most conventional features used in speaker authentication are based on estimation of spectral envelopes in one way or another, e.g., Mel-scale Filterbank Cepstrum Coefficients (MFCCs), Linear-scale Filterbank Cepstrum Coefficients (LFCCs) and Relative Spectral Perceptual Linear Prediction (RASTA-PLP). In this study, Spectral Subband Centroids (SSCs) are examined. These features are the centroid...

متن کامل

An Evaluation of DTW, AA and ARVM for Fixed-Text Speaker Identification

2007

Atanas Ouzounov

Three different methodologies for automatic speaker identification have been evaluated in the paper, namely the well known Dynamic Time Warping (DTW), the Auto-Regressive Vector Models (ARVM) and an Algebraic Approach (AA). The aim of our study is to examine the effectiveness of these approaches in the fixed-text speaker identification task with short phrases in Bulgarian language collected ove...

متن کامل

Source-filter Based Clustering for Monaural Blind Source Separation

2009

Martin Spiertz Volker Gnann

In monaural blind audio source separation scenarios, a signal mixture is usually separated into more signals than active sources. Therefore it is necessary to group the separated signals to the final source estimations. Traditionally grouping methods are supervised and thus need a learning step on appropriate training data. In contrast, we discuss unsupervised clustering of the separated channe...

متن کامل

Perceptual Significance of Cepstral Distortion Measures in Digital Speech Processing

2011

Antonio Vasilijević Davor Petrinović

Currently, one of the most widely used distance measures in speech and speaker recognition is the Euclidean distance between mel frequency cepstral coefficients (MFCC). MFCCs are based on filter bank algorithm whose filters are equally spaced on a perceptually motivated mel frequency scale. The value of mel cepstral vector, as well as the properties of the corresponding cepstral distance, are d...

متن کامل

Integrating the energy information into MFCC

2000

Fang Zheng Guoliang Zhang

The Mel-Frequency Cepstrum Coefficients (MFCC) is a widely used set of feature used in automatic speech recognition systems introduced in 1980 by Davis and Mermelstein [2]. In this traditional implementation, the 0 coefficient is excluded for the reason it is somewhat unreliable. In this paper, we analyze this term and find that it can be regarded as the generalized frequency band energy (FBE) ...

متن کامل

How Efficient Is Mpeg-7 for General Sound Recognition?

2004

HYOUNG-GOOK KIM JUAN JOSÉ BURRED THOMAS SIKORA

Our challenge is to analyze/classify video sound track content for indexing purposes. To this end we compare the performance of MPEG-7 Audio Spectrum Projection (ASP) features based on several basis decomposition algorithms vs. Mel-scale Frequency Cepstrum Coefficients (MFCC). For basis decomposition in the feature extraction we evaluate three approaches: Principal Component Analysis (PCA), Ind...

متن کامل

Comparing MFCC and MPEG-7 Audio Features for Feature Extraaction, Maximum Likelihood HMM and Entropic Prior HMM for Sports Audio Classification

2004

Ziyou Xiong Regunathan Radhakrishnan Ajay Divakaran Thomas S. Huang

We present a comparison of 6 methods for classification of sports audio. For the feature extraction we have two choices: MPEG-7 audio features and Mel-scale Frequency Cepstrum Coefficients (MFCC). For the classificaiton we also have two choices: Maximum Likelihood Hidden Markov Models (ML-HMM) and Entropic Prior HMM(EP-HMM). EP-HMM, in turn, have two variations: with and without trimming of the...

متن کامل

Automatic segmentation of speakers in broadcast audio material

2004

Hyoung-Gook Kim Thomas Sikora

In this paper, dimension-reduced, decorrelated spectral features for general sound recognition are applied to segment conversational speech of both broadcast news audio and panel discussion television programs. Without a priori information about number of speakers, the audio stream is segmented by a hybrid metric-based and model-based segmentation algorithm. For the measure of the performance w...

متن کامل

Hybrid feature extraction method of MFCC+GFCC helicopter noise based on wavelet decomposition

Journal: :Journal of physics 2023

Abstract Aiming at the issue that recognition accuracy of traditional acoustic signal features is low for helicopter signals with wind noise in near field, a method extracting mixed MFCC+GFCC based on wavelet decomposition proposed. Firstly, three-layer and reconstruction are applied to signals; then, Mel-Frequency Cepstral Coefficients (MFCC) Gammatone-Frequency Cepstrum Coefficient (GFCC) res...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید