نتایج جستجو برای: gmm model

تعداد نتایج: 2106730  

2013
Vishwa Gupta Gilles Boulianne

In this paper we look at real-time computing issues in large vocabulary speech recognition. We use the French broadcast audio transcription task from ETAPE 2011 for this evaluation. We compare word error rate (WER) versus overall computing time for hidden Markov models with Gaussian mixtures (GMM-HMM) and deep neural networks (DNN-HMM). We show that for a similar computing during recognition, t...

Journal: :Digital Signal Processing 2000
Robert B. Dunn Douglas A. Reynolds Thomas F. Quatieri

Two approaches to detecting and tracking speakers in multispeaker audio are described. Both approaches use an adapted Gaussian mixture model, universal background model (GMM-UBM) speaker detection system as the core speaker recognition engine. In one approach, the individual log-likelihood ratio scores, which are produced on a frame-by-frame basis by the GMM-UBM system, are used to first partit...

2002
Peng Ding Yang Liu Bo Xu

In this paper, the statistical method of Factor Analysis(FA) is studied on Gaussian Mixture Model(GMM) based speaker identification(SI) system to model the data covariance which is usually neglected due to the training data sparseness. Because the variance of GMM can represents speaker variability, it is very important in SI systems. By FA modeled the data covariance, a relative gain of 39.6% o...

2011
Jae-Hun Choi Sang-Kyun Kim Joon-Hyuk Chang

In this letter, we present a speech enhancement technique based on the ambient noise classification incorporating the Gaussian mixture model (GMM). The principal parameters of the statistical model-based speech enhancement algorithm such as the weighting parameter in the decision-directed (DD) method and the long-term smoothing parameter of the noise estimation, are chosen as different values a...

Journal: :Speech Communication 2015
Richard D. McClanahan Phillip L. De Leon

The majority of state-of-the-art speaker recognition systems (SR) utilize speaker models that are derived from an adapted universal background model (UBM) in the form of a Gaussian mixture model (GMM). This is true for GMM supervector systems, joint factor analysis systems, and most recently i-vector systems. In all of these systems, the posterior probabilities and sufficient statistics calcula...

1998
James Moody Stefan Slomka Jason W. Pelecanos Sridha Sridharan

This paper studies the reliance of a Gaussian Mixture Model (GMM) based closed-set Speaker Identification system on model convergence and describes methods to improve this convergence. It shows that the reason why the Vector Quantisation GMMs (VQGMMs) outperform a simple GMM is mainly due to decreasing the complexity of the data during training. In addition, it is shown that the VQGMM system is...

Journal: :JIPS 2013
Huynh Trung Manh Gueesang Lee

Object segmentation is a challenging task in image processing and computer vision. In this paper, we present a visual attention based segmentation method to segment small sized interesting objects in natural images. Different from the traditional methods, we first search the region of interest by using our novel saliency-based method, which is mainly based on band-pass filtering, to obtain the ...

2017
Ruchir Travadi Shrikanth S. Narayanan

Total Variability Model Conventionally formulated as generative model over features Assumes that feature vector distribution is a GMM Distribution Free Formulation Follows from asymptotic normality of Baum-Welch statistics Connections to probabilistic PCA, matrix completion Leads to a stochastic subgradient descent based estimation algorithm Conventional Formulation Collection of feature vector...

2012
Toshihiko Yamasaki Tomoaki Matsunami

In this paper, we propose a method to analyze gender of the pedestrian and whether he or she has a baggage or not in a public space. The challenging part of this work is we only use top-view camera images to protect the pedestrians’ privacy. We focused on temporal changes in their position, shape, and contours over the frames because their appearances do not provide much information. We extract...

2010
Shih-Sian Cheng I-Fan Chen Hsin-Min Wang

This paper presents a Bayesian approach for Gaussian mixture model (GMM)-based speaker identification. Some approaches evaluate the speaker score of a test speech utterance using a single data likelihood over the GMM learned by point estimation methods according to the maximum likelihood or maximum a posteriori criteria. In contrast, the Bayesian approach evaluates the score by using the expect...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید