mel frequency cepstral coefficient mfcc

نتایج جستجو برای: mel frequency cepstral coefficient mfcc

تعداد نتایج: 644930 فیلتر نتایج به سال:

Unsupervised speaker segmentation with residual phase and MFCC features

Journal: :Expert Syst. Appl. 2009

S. Jothilakshmi Vennila Ramalingam S. Palanivel

This paper proposes an unsupervised method for improving the automatic speaker segmentation performance by combining the evidence from residual phase (RP) and mel frequency cepstral coefficients (MFCC). This method demonstrates the complementary nature of speaker specific information present in the residual phase in comparison with the information present in the conventional MFCC. Moreover this...

متن کامل

Combining amplitude and phase-based features for speaker verification with short duration utterances

2015

Md. Jahangir Alam Patrick Kenny Themos Stafylakis

Due to the increasing use of fusion in speaker recognition systems, one trend of current research activity focuses on new features that capture complementary information to the MFCC (Mel-frequency cepstral coefficients) for improving speaker recognition performance. The goal of this work is to combine (or fuse) amplitude and phase-based features to improve speaker verification performance. Base...

متن کامل

Emotion recognition in spontaneous speech using GMMs

2006

Daniel Neiberg Kjell Elenius Kornel Laskowski

Automatic detection of emotions has been evaluated using standard Mel-frequency Cepstral Coefficients, MFCCs, and a variant, MFCC-low, calculated between 20 and 300 Hz, in order to model pitch. Also plain pitch features have been used. These acoustic features have all been modeled by Gaussian mixture models, GMMs, on the frame level. The method has been tested on two different corpora and langu...

متن کامل

Predicting Formant Frequencies from MFCC Vectors

2005

Jonathan Darch Ben P. Milner Xu Shao Saeed Vaseghi Qin Yan

This work proposes a novel method of predicting formant frequencies from a stream of mel-frequency cepstral coefficients (MFCC) feature vectors. Prediction is based on modelling the joint density of MFCCs and formant frequencies using a Gaussian mixture model (GMM). Using this GMM and an input MFCC vector, two maximum a posteriori (MAP) prediction methods are developed. The first method predict...

متن کامل

Development of a Real-time Embedded System for Speech Emotion Recognition

2014

Amiya Kumar Samantaray Kamala Kanta Mahapatra Kamala Kanta

Speech emotion recognition is one of the latest challenges in speech processing and Human Computer Interaction (HCI) in order to address the operational needs in real world applications. Besides human facial expressions, speech has proven to be one of the most promising modalities for automatic human emotion recognition. Speech is a spontaneous medium of perceiving emotions which provides in-de...

متن کامل

Combining five acoustic level modeling methods for automatic speaker age and gender recognition

2010

Ming Li Chi-Sang Jung Kyu Jeong Han

This paper presents a novel automatic speaker age and gender identification approach which combines five different methods at the acoustic level to improve the baseline performance. The five subsystems are (1) Gaussian mixture model (GMM) system based on mel-frequency cepstral coefficient (MFCC) features, (2) Support vector machine (SVM) based on GMM mean supervectors, (3) SVM based on GMM maxi...

متن کامل

Pengenalan Pola Emosi Manusia Berdasarkan Ucapan Menggunakan Ekstraksi Fitur Mel-Frequency Cepstral Coefficients (MFCC)

Journal: :CogITo Smart Journal 2019

متن کامل

Integrating Complementary Features from Vocal Source and Vocal Tract for Speaker Identification

Journal: :IJCLCLP 2007

Nengheng Zheng Tan Lee Ning Wang Pak-Chung Ching

This paper describes a speaker identification system that uses complementary acoustic features derived from the vocal source excitation and the vocal tract system. Conventional speaker recognition systems typically adopt the cepstral coefficients, e.g., Mel-frequency cepstral coefficients (MFCC) and linear predictive cepstral coefficients (LPCC), as the representative features. The cepstral fea...

متن کامل

Evaluation of Quantile Based Histogram Equalization in Combination with Different Root Functions

2005

Florian Hilger Hermann Ney

This paper presents an evaluation of the RWTH large vocabulary speech recognition system on the Aurora 4 noisy Wall Street Journal database. First, the influence of different root functions replacing the logarithm in the feature extraction is studied. Then quantile based histogram equalization is applied, a parametric method to increase the noise robustness by reducing the mismatch between the ...

متن کامل

PCA-Based Speech Enhancement for Distorted Speech Recognition

Journal: :Journal of Multimedia 2007

Tetsuya Takiguchi Yasuo Ariki

We investigated a robust speech feature extraction method using kernel PCA (Principal Component Analysis) for distorted speech recognition. Kernel PCA has been suggested for various image processing tasks requiring an image model, such as denoising, where a noise-free image is constructed from a noisy input image [1]. Much research for robust speech feature extraction has been done, but it rema...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید