نتایج جستجو برای: gaussian mixed model gmm

تعداد نتایج: 2329145  

1999
Mouhamadou Seck Frédéric Bimbot Didier Zugaj Bernard Delyon

We present a technique for the segmention of a sound track into two classes of segments. Each frame of signal is preprocessed by extracting cepstral coefficients and their first order derivatives. For each class, the distribution of the frame parameter vectors is modeled by a Gaussian Mixture Model (GMM). GMM order is selected using two criteria : the Minimum Description Length (MDL) criterion ...

2012
Aidong Deng Xiaodan Zhang Jianeng Tang Li Zhao Kang

Because of containing several model waveforms and transmission speed of each model are various, the source signal of rub-impact acoustic emission (AE) will lead to waveform distortion in propagation process, and it is difficult to achieve exact source location by traditional time difference of arrival algorithm. A chaotic neural network technique was introduced to calculate the location of AE s...

2014
Md Jahangir Alam Patrick Kenny Pierre Ouellet Themos Stafylakis Pierre Dumouchel

Voice activity detection, i.e., discrimination of the speech/nonspeech segments in a speech signal, is an important enabling technology for a variety of speech-based applications including the speaker recognition. In this work we provide a performance evaluation of the following supervised and unsupervised VAD algorithms in the context of text-dependent speaker recognition on the RSR2015 (Robus...

Journal: :IRA-International Journal of Technology & Engineering (ISSN 2455-4480) 2017

2004
William M. Campbell Douglas A. Reynolds Joseph P. Campbell

Discriminatively trained support vector machines have recently been introduced as a novel approach to speaker recognition. Support vector machines (SVMs) have a distinctly different modeling strategy in the speaker recognition problem. The standard Gaussian mixture model (GMM) approach focuses on modeling the probability density of the speaker and the background (a generative approach). In cont...

2013
Ryan Price Sangeeta Biswas Koichi Shinoda

This study combines a Gaussian mixture model support vector machine (GMM-SVM) system with a nonlinear feature transformation, discriminatively trained to extract speaker specific features from MFCCs. Separation of the speaker information component and non-speaker related information in the speech signal is accomplished using a regularized siamese deep network (RSDN). RSDN learns a hidden repres...

Journal: :EURASIP J. Image and Video Processing 2013
Yusuke Kamishima Nakamasa Inoue Koichi Shinoda

In large-scale multimedia event detection, complex target events are extracted from a large set of consumer-generated web videos taken in unconstrained environments. We devised a multimedia event detection method based on Gaussian mixture model (GMM) supervectors and support vector machines. A GMM supervector consists of the parameters of a GMM for the distribution of low-level features extract...

2011
Ladislav Lenc Pavel Král

This paper deals with Automatic Face Recognition (AFR), which means automatic identification of a person from a digital image. Our work focuses on an application for Czech News Agency that will facilitate to identify a person in a large database of photographs. The main goal of this paper is to propose some modifications and improvements of existing face recognition approaches and to evaluate t...

Journal: :CoRR 2014
Nitesh Kumar Chaudhary

An efficient, and intuitive algorithm is presented for the identification of speakers from a long dataset (like YouTube long discussion, Cocktail party recorded audio or video).The goal of automatic speaker identification is to identify the number of different speakers and prepare a model for that speaker by extraction, characterization and speaker-specific information contained in the speech s...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید