mel frequency cepstral coefficient

نتایج جستجو برای: mel frequency cepstral coefficient

تعداد نتایج: 644186 فیلتر نتایج به سال:

Unsupervised Classification of Hydrophone Signals With an Improved Mel-Frequency Cepstral Coefficient Based on Measured Data Analysis

Journal: :IEEE Access 2019

متن کامل

Modified Mel-frequency Cepstrum Coefficient

2003

Li Tan Montri Karnjanadecha

This paper describes the principle of MFCC feature extraction and the knowledge of human auditory masking effect in order to introduce a modified-MFCC feature extraction that can improve the robustness of speech recognition systems.

متن کامل

Improving the Robustness of LPCC Feature Against Impulsive Noise by Applying the FOP Method

2006

Pei Ding

Performance of an automatic speech recognition (ASR) system tends to be dramatically degraded in the presence of impulsive noise. In the previous work [1], we proposed flooring the observation probability (FOP) to compensate the adverse effect of impulsive noise on sensitive dimensions of Mel-frequency cepstral coefficient (MFCC) features. Linear prediction cepstral coefficient (LPCC) is anothe...

متن کامل

Performance Comparison of Neural Networks and GMM for Vocal/Nonvocal segmentation for Singer Identification

2014

Ananya Bonjyotsna Manabendra Bhuyan

Vocal and nonvocal segmentation is an important task in singing voice signal processing. Before identifying the singer it is necessary to locate the singer’s voice in a song. Maximum of the songs start with a piece of instrumental accompaniment known as ‘prelude’ in musical terms after which the singing voice comes into play. Therefore, it is necessary to detect the vocal region in the song in ...

متن کامل

Pitch Mean Based Frequency Warping

2006

Jian Liu Thomas Fang Zheng Wenhu Wu

In this paper, a novel pitch mean based frequency warping (PMFW) method is proposed to reduce the pitch variability in speech signals at the frontend of speech recognition. The warp factors used in this process are calculated based on the average pitch of a speech segment. Two functions to describe the relations between the frequency warping factor and the pitch mean are defined and compared. W...

متن کامل

Mel-lp Based Generalized Cepstral Analysis for Noisy Speech Recognition Using Hmm

2013

Md. Rashedul Islam Firoz Ahmed Najmul Hossain Md. Abdur Rahim

This paper deals with LP based Mel-Generalized cepstrum which has been used as front-end for Hidden Markov Model (HMM) based speech recognition and it incorporates equal-loudness power law as well as auditory-like frequency resolution. To utilize the generalized cepstral representation, the model spectrum can be varied continuously from the all-pole spectrum to that represented by the cepstrum ...

متن کامل

Improving the performance of HMM-based very low bit rate speech coding

2003

Takahiro Hoshiya Shinji Sako Heiga Zen Keiichi Tokuda Takashi Masuko Takao Kobayashi Tadashi Kitamura

In this paper, we define an F0 quantization scheme for a very low bit rate speech coder based on HMM (Hidden Markov Model). In the coding system, the encoder carries out phoneme recognition, and transmits phoneme indices, state durations and F0 information to the decoder. In the decoder, phoneme HMMs are concatenated according to the phoneme indices, and a sequence of mel-cepstral coefficient v...

متن کامل

Development of a Real-time Embedded System for Speech Emotion Recognition

2014

Amiya Kumar Samantaray Kamala Kanta Mahapatra Kamala Kanta

Speech emotion recognition is one of the latest challenges in speech processing and Human Computer Interaction (HCI) in order to address the operational needs in real world applications. Besides human facial expressions, speech has proven to be one of the most promising modalities for automatic human emotion recognition. Speech is a spontaneous medium of perceiving emotions which provides in-de...

متن کامل

STEGANALISIS SINYAL WICARA BERFORMAT .WAV MENGGUNAKAN KOMBINASI METODE MEL-FREQUENCY CEPSTRAL COEFFICIENT (MFCC) DAN LINEAR DISCRIMINANT ANALYSIS (LDA)

Journal: :TEKTRIKA - Jurnal Penelitian dan Pengembangan Telekomunikasi, Kendali, Komputer, Elektrik, dan Elektronika 2019

متن کامل

EARLYZER: perceptualy motivated robust TFR of speech

1999

J. V. Avadhanulu M. Mathew Thippur V. Sreenivas

Development of robust and efficient front-end is crucial for robust ASR. Proper time and frequency resolution of the TFR of speech, motivated by the auditory models is considered an important factor for robustness. An efficient method of realizing a variable resolution TFR using DTFT/Goertzel algorithm is proposed instead of the standard FFT based approach. It is shown that the new representati...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید