نتایج جستجو برای: gmm model

تعداد نتایج: 2106730  

2010
Yamato Ohtani

Voice conversion (VC) is a technique for converting a source speaker’s voice into another speaker’s voice without changing linguistic information. As a typical approach to VC, a statistical method based on Gaussian mixture model (GMM) is used widely. A GMM is trained as a conversion model using a parallel data set composed of many utterance-pairs of source and target speakers. Although this fra...

2002
Matthew N. Stuttle Mark J. F. Gales

Fitting a Gaussian mixture model (GMM) to the smoothed speech spectrum allows an alternative set of features to be extracted from the speech signal. These features have been shown to possess information complementary to the standard MFCC parameterisation. This paper further investigates the use of these GMM features in combination with MFCCs. The extraction and use of a confidence metric to com...

2010
Ming Li Chi-Sang Jung Kyu Jeong Han

This paper presents a novel automatic speaker age and gender identification approach which combines five different methods at the acoustic level to improve the baseline performance. The five subsystems are (1) Gaussian mixture model (GMM) system based on mel-frequency cepstral coefficient (MFCC) features, (2) Support vector machine (SVM) based on GMM mean supervectors, (3) SVM based on GMM maxi...

2014
Min Liu Gregory R. Hancock

Growth mixture modeling has gained much attention in applied and methodological social science research recently, but the selection of the number of latent classes for such models remains a challenging issue, especially when the assumption of proper model specification is violated. The current simulation study compared the performance of a linear growth mixture model (GMM) for determining the c...

2012
Tomi Kinnunen Rahim Saeidi Jussi Leppänen Jukka Saarinen

The problem of context recognition from mobile audio data is considered. We consider ten different audio contexts (such as car, bus, office and outdoors) prevalent in daily life situations. We choose mel-frequency cepstral coefficient (MFCC) parametrization and present an extensive comparison of six different classifiers: knearest neighbor (kNN), vector quantization (VQ), Gaussian mixture model...

Journal: :CoRR 2015
Xin Yuan Hong Jiang Gang Huang Paul A. Wilford

We develop a new compressive sensing (CS) inversion algorithm by utilizing the Gaussian mixture model (GMM). While the compressive sensing is performed globally on the entire image as implemented in our lensless camera, a lowrank GMM is imposed on the local image patches. This lowrank GMM is derived via eigenvalue thresholding of the GMM trained on the projection of the measurement data, thus l...

2004
Patrick Gagliardini Fabio Trojani Giovanni Urga

We propose a class of new robust Generalized Method of Moments (GMM) tests for endogenous structural breaks. The tests are based on supremum, average and exponential functionals derived from robust GMM estimators with bounded influence function. We study the theoretical local robustness properties of the new tests and show that they imply a uniformly bounded asymptotic sensitivity of size and p...

2011
Avi Matza

The current paper proposes skew Gaussian mixture models for speaker recognition and an associated algorithm for its training from experimental data. Speaker identification experiments were conducted, in which speakers were modeled using the familiar Gaussian mixture models (GMM), and the new skewGMM. Each model type was evaluated using two sets of feature vectors, the mel-frequency cepstral coe...

2006
Rongqing Huang

Automatic dialect classification has gained interests in the field of speech research because it is important to characterize speaker traits and to estimate knowledge that could improve integrated speech technology (e.g., speech recognition, speaker recognition). This study addresses novel advances in unsupervised spontaneous Latin American Spanish dialect classification. The problem considers ...

2004
Tomoki Toda Alan W. Black Keiichi Tokuda

This paper describes a method for determining the vocal tract spectrum from articulatory movements using a Gaussian Mixture Model (GMM) to synthesize speech with articulatory information. The GMM on joint probability density of articulatory parameters and acoustic spectral parameters is trained using a parallel acousticarticulatory speech database. We evaluate the performance of the GMM-based m...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید