مدلسازی gmm

نتایج جستجو برای: مدلسازی gmm

تعداد نتایج: 16460 فیلتر نتایج به سال:

Boosting with anti-models for automatic language identification

2007

Xi Yang Man-Hung Siu Herbert Gish Brian Kan-Wing Mak

In this paper, we adopt the boosting framework to improve the performance of acoustic-based Gaussian mixture model (GMM) Language Identification (LID) systems. We introduce a set of low-complexity, boosted target and anti-models that are estimated from training data to improve class separation, and these models are integrated during the LID backend process. This results in a fast estimation pro...

متن کامل

Speaker recognition using phoneme-specific GMMs

2004

Eric G. Hansen Raymond E. Slyh Timothy R. Anderson

This paper compares three approaches to building phoneme-specific Gaussian mixture model (GMM) speaker recognition systems on the NIST 2003 Extended Data Evaluation to a baseline GMM system covering all of the phonemes. The individual performance of any given phoneme-specific GMM system falls below the performance of the baseline GMM, but fusing the top 40 performing scores of the individual ph...

متن کامل

GMM mapping of visual features of cued speech from speech spectral features

2013

Zuheng Ming Denis Beautemps Gang Feng

In this paper, we present a statistical method based on GMM modeling to map the acoustic speech spectral features to visual features of Cued Speech in the regression criterion of Minimum Mean-Square Error (MMSE) in a low signal level which is innovative and different with the classic text-to-visual approach. Two different training methods for GMM, namely Expecting-Maximization (EM) approach and...

متن کامل

Lattice-based training of bottleneck feature extraction neural networks

2013

Matthias Paulik

This paper investigates a method for training bottleneck (BN) features in a more targeted manner for their intended use in GMM-HMM based ASR. Our approach adds a GMM acoustic model activation layer to a standard BN feature extraction (FE) neural network and performs lattice-based MMI training on the resulting network. After training, the network is reverted back into a working BN FE network by ...

متن کامل

Acoustic vector resampling for GMMSVM-based speaker verification

2010

Man-Wai Mak Wei Rao

Using GMM-supervectors as the input to SVM classifiers (namely, GMM-SVM) is one of the promising approaches to text-independent speaker verification. However, one unaddressed issue of this approach is the severe imbalance between the numbers of speaker-class utterances and impostor-class utterances available for training a speaker-dependent SVM. This paper proposes a resampling technique – name...

متن کامل

Memory-Based Approximation of the Gaussian Mixture Model Framework for Bandwidth Extension of Narrowband Speech

2011

Amr H. Nour-Eldin Peter Kabal

In this paper, we extend our previous work on exploiting speech temporal properties to improve Bandwidth Extension (BWE) of narrowband speech using Gaussian Mixture Models (GMMs). By quantifying temporal properties through information theoretic measures and using delta features, we have shown that narrowband memory significantly increases certainty about highband parameters. However, as delta f...

متن کامل

High-level feature weighted GMM network for audio stream classification

2004

Rongqing Huang John H. L. Hansen

The problem of unsupervised audio classification continuous to be a challenging research problem which significantly impacts ASR and Spoken Document Retrieval (SDR) performance. This paper addresses novel advances in audio classification for speech recognition. A new algorithm is proposed for audio classification, which is based on Weighted GMM Network (WGN). Two new high-level features: VSF (V...

متن کامل

Bias correction methods for dynamic panel data models with fixed effects

2017

Mohamed Reda Abonazel

This paper considers the estimation methods for dynamic panel data (DPD) models with fixed effects, which suggested in econometric literature, such as least squares (LS) and generalized method of moments (GMM). These methods obtain biased estimators for DPD models. The LS estimator is inconsistent when the time dimension (T) is short regardless of the cross-sectional dimension (N). Although con...

متن کامل

High quality voice conversion based on Gaussian mixture model with dynamic frequency warping

2001

Tomoki Toda Hiroshi Saruwatari Kiyohiro Shikano

In the voice conversion algorithm based on the Gaussian Mixture Model (GMM), quality of the converted speech is degraded because the converted spectrum is exceedingly smoothed. In this paper, we newly propose the GMM-based algorithm with the Dynamic Frequency Warping (DFW) to avoid the over-smoothing. We also propose that the converted spectrum is calculated by mixing the GMM-based converted sp...

متن کامل

Feature-based and channel-based analyses of intrinsic variability in speaker verification

2009

Martin Graciarena Tobias Bocklet Elizabeth Shriberg Andreas Stolcke Sachin S. Kajarekar

We explore how intrinsic variations (those associated with the speaker rather than the recording environment) affect textindependent speaker verification performance. In a previous paper we introduced the SRI-FRTIV corpus and provided speaker verification results using a Gaussian mixture model (GMM) system on telephone-channel speech. In this paper we explore the use of other speaker verificati...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید