نتایج جستجو برای: gmm model

تعداد نتایج: 2106730  

2008
Xing Xing

In this project report, we have investigated the video modeling techniques and realized a statistical video representation and modeling scheme [1], which could be used for later video retrieval and content extraction task. This method utilizes Gaussian mixture model (GMM) to segment video content into coherent space-time segments within the video frames and across frames. It treats space and ti...

2004
William M. Campbell Douglas A. Reynolds Joseph P. Campbell

Discriminatively trained support vector machines have recently been introduced as a novel approach to speaker recognition. Support vector machines (SVMs) have a distinctly different modeling strategy in the speaker recognition problem. The standard Gaussian mixture model (GMM) approach focuses on modeling the probability density of the speaker and the background (a generative approach). In cont...

2013
Ryan Price Sangeeta Biswas Koichi Shinoda

This study combines a Gaussian mixture model support vector machine (GMM-SVM) system with a nonlinear feature transformation, discriminatively trained to extract speaker specific features from MFCCs. Separation of the speaker information component and non-speaker related information in the speech signal is accomplished using a regularized siamese deep network (RSDN). RSDN learns a hidden repres...

Journal: :EURASIP J. Image and Video Processing 2013
Yusuke Kamishima Nakamasa Inoue Koichi Shinoda

In large-scale multimedia event detection, complex target events are extracted from a large set of consumer-generated web videos taken in unconstrained environments. We devised a multimedia event detection method based on Gaussian mixture model (GMM) supervectors and support vector machines. A GMM supervector consists of the parameters of a GMM for the distribution of low-level features extract...

Journal: :Journal of rehabilitation research and development 2002
Karen L Perell Scott Gregor Gene Kim Sirintorn Rushatakankovit Erika Scremin Seymour Levin Robert Gregor

We compared recumbent bicycle kinetics in diabetic peripheral neuropathy and nondiabetic men (nine per group). 3D kinematic and force pedal data in a linked-segment model were used. The generalized muscle moment (GMM) patterns were similar between the two groups except for (1) decreased maximum knee flexor moment, (2) increased minimum knee flexor GMM, and (3) maximum hip extensor GMM by the di...

2012
Osman BÜYÜK Mustafa Levent ARSLAN

In this paper, we investigate model selection and channel variability issues on a text-dependent single utterance (TDSU) speaker verification application. Due to the lack of an appropriate database for the task, a multichannel speaker recognition database, which consists of multiple recordings of a single Turkish utterance, is collected. The first set of experiments is devoted to model selectio...

Journal: :Computer Speech & Language 2016
Sandesh Aryal Ricardo Gutierrez-Osuna

The conventional approach for data-driven articulatory synthesis consists of modeling the joint acoustic-articulatory distribution with a Gaussian mixture model (GMM), followed by a post-processing step that optimizes the resulting acoustic trajectories. This final step can significantly improve the accuracy of the GMM frame-by-frame mapping but is computationally intensive and requires that th...

2006
José C. Principe John G. Harris John M. Shea

of Dissertation Presented to the Graduate School of the University of Florida in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy GAUSSIAN MIXTURE MODEL BASED SYSTEM IDENTIFICATION AND CONTROL By Jing Lan August 2006 Chair: José C. Principe Major Department: Electrical and Computer Engineering In this dissertation, we present a methodology of combining an improved ...

2013
Matthias Paulik

This paper investigates a method for training bottleneck (BN) features in a more targeted manner for their intended use in GMM-HMM based ASR. Our approach adds a GMM acoustic model activation layer to a standard BN feature extraction (FE) neural network and performs lattice-based MMI training on the resulting network. After training, the network is reverted back into a working BN FE network by ...

2001
Tomoki Toda Hiroshi Saruwatari Kiyohiro Shikano

In the voice conversion algorithm based on the Gaussian Mixture Model (GMM), quality of the converted speech is degraded because the converted spectrum is exceedingly smoothed. In this paper, we newly propose the GMM-based algorithm with the Dynamic Frequency Warping (DFW) to avoid the over-smoothing. We also propose that the converted spectrum is calculated by mixing the GMM-based converted sp...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید