gmm model

Doctoral Thesis Techniques for Improving Voice Conversion Based on Eigenvoices

2010

Yamato Ohtani

Voice conversion (VC) is a technique for converting a source speaker’s voice into another speaker’s voice without changing linguistic information. As a typical approach to VC, a statistical method based on Gaussian mixture model (GMM) is used widely. A GMM is trained as a conversion model using a parallel data set composed of many utterance-pairs of source and target speakers. Although this fra...

متن کامل

Combining a Gaussian mixture model front end with MFCC parameters

2002

Matthew N. Stuttle Mark J. F. Gales

Fitting a Gaussian mixture model (GMM) to the smoothed speech spectrum allows an alternative set of features to be extracted from the speech signal. These features have been shown to possess information complementary to the standard MFCC parameterisation. This paper further investigates the use of these GMM features in combination with MFCCs. The extraction and use of a confidence metric to com...

متن کامل

Combining five acoustic level modeling methods for automatic speaker age and gender recognition

2010

Ming Li Chi-Sang Jung Kyu Jeong Han

This paper presents a novel automatic speaker age and gender identification approach which combines five different methods at the acoustic level to improve the baseline performance. The five subsystems are (1) Gaussian mixture model (GMM) system based on mel-frequency cepstral coefficient (MFCC) features, (2) Support vector machine (SVM) based on GMM mean supervectors, (3) SVM based on GMM maxi...

متن کامل

Unrestricted Mixture Models for Class Identification in Growth Mixture Modeling

2014

Min Liu Gregory R. Hancock

Growth mixture modeling has gained much attention in applied and methodological social science research recently, but the selection of the number of latent classes for such models remains a challenging issue, especially when the assumption of proper model specification is violated. The current simulation study compared the performance of a linear growth mixture model (GMM) for determining the c...

متن کامل

Audio context recognition in variable mobile environments from short segments using speaker and language recognizers

2012

Tomi Kinnunen Rahim Saeidi Jussi Leppänen Jukka Saarinen

The problem of context recognition from mobile audio data is considered. We consider ten different audio contexts (such as car, bus, office and outdoors) prevalent in daily life situations. We choose mel-frequency cepstral coefficient (MFCC) parametrization and present an extensive comparison of six different classifiers: knearest neighbor (kNN), vector quantization (VQ), Gaussian mixture model...

متن کامل

Compressive Sensing via Low-Rank Gaussian Mixture Models

Journal: :CoRR 2015

Xin Yuan Hong Jiang Gang Huang Paul A. Wilford

We develop a new compressive sensing (CS) inversion algorithm by utilizing the Gaussian mixture model (GMM). While the compressive sensing is performed globally on the entire image as implemented in our lensless camera, a lowrank GMM is imposed on the local image patches. This lowrank GMM is derived via eigenvalue thresholding of the GMM trained on the projection of the measurement data, thus l...

متن کامل

Robust GMM tests for structural breaks

2004

Patrick Gagliardini Fabio Trojani Giovanni Urga

We propose a class of new robust Generalized Method of Moments (GMM) tests for endogenous structural breaks. The tests are based on supremum, average and exponential functionals derived from robust GMM estimators with bounded influence function. We study the theoretical local robustness properties of the new tests and show that they imply a uniformly bounded asymptotic sensitivity of size and p...

متن کامل

Skew Gaussian Mixture Models for Speaker Recognition

2011

Avi Matza

The current paper proposes skew Gaussian mixture models for speaker recognition and an associated algorithm for its training from experimental data. Speaker identification experiments were conducted, in which speakers were modeled using the familiar Gaussian mixture models (GMM), and the new skewGMM. Each model type was evaluated using two sets of feature vectors, the mel-frequency cepstral coe...

متن کامل

Gaussian Mixture Selection and Data Selection for Unsupervised Spanish Dialect Classification

2006

Rongqing Huang

Automatic dialect classification has gained interests in the field of speech research because it is important to characterize speaker traits and to estimate knowledge that could improve integrated speech technology (e.g., speech recognition, speaker recognition). This study addresses novel advances in unsupervised spontaneous Latin American Spanish dialect classification. The problem considers ...

متن کامل

Mapping from articulatory movements to vocal tract spectrum with Gaussian mixture model for articulatory speech synthesis

2004

Tomoki Toda Alan W. Black Keiichi Tokuda

This paper describes a method for determining the vocal tract spectrum from articulatory movements using a Gaussian Mixture Model (GMM) to synthesize speech with articulatory information. The GMM on joint probability density of articulatory parameters and acoustic spectral parameters is trained using a parallel acousticarticulatory speech database. We evaluate the performance of the GMM-based m...

متن کامل