نتایج جستجو برای: gaussian mixture model gmm
تعداد نتایج: 2220569 فیلتر نتایج به سال:
Fitting a Gaussian mixture model (GMM) to the smoothed speech spectrum allows an alternative set of features to be extracted from the speech signal. These features have been shown to possess information complementary to the standard MFCC parameterisation. This paper further investigates the use of these GMM features in combination with MFCCs. The extraction and use of a confidence metric to com...
This paper presents a novel automatic speaker age and gender identification approach which combines five different methods at the acoustic level to improve the baseline performance. The five subsystems are (1) Gaussian mixture model (GMM) system based on mel-frequency cepstral coefficient (MFCC) features, (2) Support vector machine (SVM) based on GMM mean supervectors, (3) SVM based on GMM maxi...
We show that a classifier based on Gaussian mixture models (GMM) can be trained discriminatively to improve accuracy. We describe a training procedure based on the extended Baum-Welch algorithm used in speech recognition. We also compare the accuracy and degree of sparsity of the new discriminative GMM classifier with those of generative GMM classifiers, and of kernel classifiers, such as suppo...
We present an efficient object detection and tracking technique using still cameras in low contrast conditions. The tracking algorithm involves background subtraction using Gaussian Mixture Model (GMM). Our method involves updating the parameters of the Mixture Model using a combination of an online k-means approximation technique and the ExpectationMaximization (EM) algorithm. We have shown ex...
This paper describes a new approach to acoustic mod-eling for large vocabulary continuous speech recognition (LVCSR) systems. Each phone is modeled with a large Gaussian mixture model (GMM) whose context-dependent mixture weights are estimated with a sentence-level discrim-inative training criterion. The estimation problem is casted in a neural network framework, which enables the incorporation...
The problem of context recognition from mobile audio data is considered. We consider ten different audio contexts (such as car, bus, office and outdoors) prevalent in daily life situations. We choose mel-frequency cepstral coefficient (MFCC) parametrization and present an extensive comparison of six different classifiers: knearest neighbor (kNN), vector quantization (VQ), Gaussian mixture model...
We describe a Monte Carlo method for model-space noise adaptation of Gaussian mixture models (GMMs). This method combines a single-Gaussian noise model with the GMM speech model to produce an adapted model. It is similar to Parallel Model Combination or model-space Joint, except that it applies to spliced and projected MFCC features rather than to MFCC plus dynamic features. We demonstrate the ...
This paper is concerned with compressive sensing of signals drawn from a Gaussian mixture model (GMM) with sparse precision matrices. Previous work has shown: (i) a signal drawn from a given GMM can be perfectly reconstructed from r noise-free measurements if the (dominant) rank of each covariance matrix is less than r; (ii) a sparse Gaussian graphical model can be efficiently estimated from fu...
In this paper,features for text-independent speaker recognition has been evaluated. Speaker identification from a set of templates and analyzing speaker recognition rate by extracting several key features like Mel Frequency Cepstral Coefficients [MFCC] from the speech signals of those persons by using the process of feature extraction using MATLAB2013 .These features are effectively captured us...
نمودار تعداد نتایج جستجو در هر سال
با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید