mel frequency cel cepstrum mfcc

نتایج جستجو برای: mel frequency cel cepstrum mfcc

تعداد نتایج: 490625 فیلتر نتایج به سال:

Discriminative weight training-based optimally weighted MFCC for gender identification

Journal: :IEICE Electronic Express 2009

Sang-Ick Kang Joon-Hyuk Chang

In this paper, we apply a discriminative weight training to a support vector machine (SVM) based gender identification. In our approach, the gender decision rule is derived by the SVM incorporating the optimally weighted mel-frequency cepstral coefficient (MFCC) based on a minimum classification error (MCE) method which is different from the previous works in that optimal weights are differentl...

متن کامل

Analysis of features from analytic representation of speech using MP-ABX measures

2015

Raghavendra Reddy Pappagari Karthika Vijayan K. Sri Rama Murty

The significance of features derived from complex analytic domain representation of speech, for different applications, is investigated. Frequency domain linear prediction (FDLP) coefficients are derived from analytic magnitude and instantaneous frequency (IF) coefficients are derived from analytic phase of speech signals. Minimal pair ABX (MP-ABX) tasks are used to analyse different features a...

متن کامل

Multi-speaker Recognition in Cocktail Party Problem

Journal: :CoRR 2017

Yiqian Wang Wensheng Sun

This paper proposes an original statistical decision theory to accomplish a multi-speaker recognition task in cocktail party problem. This theory relies on an assumption that the varied frequencies of speakers obey Gaussian distribution and the relationship of their voiceprints can be represented by Euclidean distance vectors. This paper uses Mel-Frequency Cepstral Coefficients to extract the f...

متن کامل

Generalized mel frequency cepstral coefficients for large-vocabulary speaker-independent continuous-speech recognition

Journal: :IEEE Trans. Speech and Audio Processing 1999

Rivarol Vergin Douglas D. O'Shaughnessy Azarshid Farhat

The focus of a continuous speech recognition process is to match an input signal with a set of words or sentences according to some optimality criteria. The first step of this process is parameterization, whose major task is data reduction by converting the input signal into parameters while preserving virtually all of the speech signal information dealing with the text message. This contributi...

متن کامل

Exploring subsegmental and suprasegmental features for a text-dependent speaker verification in distant speech signals

2010

B. Avinash S. Guruprasad Bayya Yegnanarayana

Existing automatic speaker verification (ASV) systems perform with high accuracy when the speech signal is collected close to the mouth of the speaker (< 1 ft). However, the performance of these systems reduces significantly when speech signals are collected at a distance from the speaker (2-6 ft). The objective of this paper is to address some issues in the processing of speech signals collect...

متن کامل

Language and Text-Independent Speaker Identification System Using GMM

2013

S. SELVA NIDHYANANTHAN SELVA KUMARI

This paper motivates the use of Dynamic Mel-Frequency Cepstral Coefficient (DMFCC) feature and combination of DMFCC and MFCC features for robust language and text-independent speaker identification. MFCC feature, modeled on the human auditory system has been the widely used feature for speaker recognition because of its less vulnerability to noise perturbation and little session variability. Bu...

متن کامل

Auditory Teager energy cepstrum coefficients for robust speech recognition

2005

Dimitrios Dimitriadis Petros Maragos Alexandros Potamianos

In this paper, a feature extraction algorithm for robust speech recognition is introduced. The feature extraction algorithm is motivated by the human auditory processing and the nonlinear Teager-Kaiser energy operator that estimates the true energy of the source of a resonance. The proposed features are labeled as Teager Energy Cepstrum Coefficients (TECCs). TECCs are computed by first filterin...

متن کامل

Detection of asphyxia in infants using deep learning Convolutional Neural Network (CNN) trained on Mel Frequency Cepstrum Coefficient (MFCC) features extracted from cry sounds

Journal: :Journal of Fundamental and Applied Sciences 2018

متن کامل

The Implementation of Speech Recognition using Mel-Frequency Cepstrum Coefficients (MFCC) and Support Vector Machine (SVM) method based on Python to Control Robot Arm

Journal: :IOP Conference Series: Materials Science and Engineering 2018

متن کامل

Music-Inspired Texture Representation

2012

Ben Horsburgh Susan Craw Stewart Massie

Techniques for music recommendation are increasingly relying on hybrid representations to retrieve new and exciting music. A key component of these representations is musical content, with texture being the most widely used feature. Current techniques for representing texture however are inspired by speech, not music, therefore music representations are not capturing the correct nature of music...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید