نتایج جستجو برای: mel frequency cepstral coefficients mfcc

تعداد نتایج: 584588  

Journal: :IJBM 2010
Md. Sahidullah Sandipan Chakroborty Goutam Saha

Conventional Speaker Identification (SI) systems utilise spectral features like Mel-Frequency Cepstral Coefficients (MFCC) or Perceptual Linear Prediction (PLP) as a frontend module. Line Spectral pairs Frequencies (LSF) are popular alternative representation of Linear Prediction Coefficients (LPC). In this paper, an investigation is carried out to extract LSF from perceptually modified speech....

2000
Kuo-Hwei Yuo Tai-Hwei Hwang Hsiao-Chuan Wang

This paper presents a method that combines the techniques of temporal trajectory filtering and projection measure for robust speaker identification. The proposed robust feature, called Relative Autocorrelation Sequence Mel-scale Frequency Cepstral Coefficients (RAS-MFCC), is derived based on filtering the temporal trajectories of short-time one-sided autocorrelation sequences. This filtering pr...

2013
Trisiladevi C. Nagavi Nagappa U. Bhajantri

Query by Singing/Humming (QBSH) is a Music Information Retrieval (MIR) system with small audio excerpt as query. The rising availability of digital music stipulates effective music retrieval methods. Further, MIR systems support content based searching for music and requires no musical acquaintance. Current work on QBSH focuses mainly on melody features such as pitch, rhythm, note etc., size of...

2004
Nengheng Zheng Pak-Chung Ching Tan Lee

This paper investigates the importance of spectrotemporal characteristics of the source excitation signal for speaker recognition. We propose an effective feature extraction technique for obtaining essential timefrequency information from the linear prediction (LP) residual signal, which are closely related to the glottal excitation of individual speaker. With pitch synchronous analysis, wavele...

2009
Shin’ichi Takeuchi Satoshi Tamura Satoru Hayamizu

This paper investigates the best feature parameter for human action recognition by using Hidden Markov Model (HMM) with triaxial acceleration sensor information. Our target is to recognize six types of basic actions (walk, stay, sit down, stand up, lie down, get up) in the room of daily life. First, as parameters in time domain, acceleration information in three axes and their derivatives are u...

2005
Rangarao Muralishankar Abhijeet Sangwan Douglas D. O'Shaughnessy

In this paper, we continue our investigation of the warped discrete cosine transform cepstrum (WDCTC), which was earlier introduced as a new speech processing feature [1]. Here, we study the statistical properties of the WDCTC and compare them with the mel-frequency cepstral coefficients (MFCC). We report some interesting properties of the WDCTC when compared to the MFCC: its statistical distri...

2013
Nassim Asbai Messaoud Bengherabi Farid Harizi Abderrahmane Amrouche

This paper evaluates the impact of low-level features on speaker verification performance, with an emphasis on the recently proposed MFCC variant based on asymmetric tapers (MFCC asymmetric from now on) standalone as features or followed by PCA as linear projection technique applied before the GMM-UBM back-end classifier in clean and noisy environments. The performances of the MFCC-asymmetric f...

Journal: :CoRR 2012
Trisiladevi C. Nagavi Nagappa U. Bhajantri

Query by Singing/Humming (QBSH) is a Music Information Retrieval (MIR) system with small audio excerpt as query. The rising availability of digital music stipulates effective music retrieval methods. Further, MIR systems support content based searching for music and requires no musical acquaintance. Current work on QBSH focuses mainly on melody features such as pitch, rhythm, note etc., size of...

2006
Hemant A. Patil T. K. Basu

Speaker Recognition (SR) is an economic method of biometrics because of availability of low cost and high power computers. An important question which must be answered for the SR system is how well the system resists the effects of determined mimics such as those based on physiological characteristics especially identical twins or triplets. In this paper, a new data fusion technique (viz., majo...

2015
C. Sunitha E. Chandra

Speaker recognition is one of the most essential tasks in the signal processing which identifies a person from characteristics of voices . In this paper we accomplish speaker recognition using Mel-frequency Cepstral Coefficient (MFCC) with Weighted Vector Quantization algorithm. By using MFCC, the feature extraction process is carried out. It is one of the nonlinear cepstral coefficient functio...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید