speaker recognition

The IIR Submission to CSLP 2006 Speaker Recognition Evaluation

2006

Kong-Aik Lee Hanwu Sun Rong Tong Bin Ma Minghui Dong Chang Huai You Donglai Zhu Chin-Wei Eugene Koh Lei Wang Tomi Kinnunen Chng Eng Siong Haizhou Li

This paper describes the design and implementation of a practical automatic speaker recognition system for the CSLP speaker recognition evaluation (SRE). The speaker recognition system is built upon four subsystems using speaker information from acoustic spectral features. In addition to the conventional spectral features, a novel temporal discrete cosine transform (TDCT) feature is introduced ...

متن کامل

A Real-Time Voice-Control Model to Access Protected Resources

2015

W. O. Adesanya

The paper presents a model developed for real-time speaker recognition system that can be used to access restricted information or resources using human voice. Speaker recognition involves identification and verification of the speaker. At each stage, the voiceprint is compared with model voices of all speakers in the database. The comparison is a measure of the similarity (score) from which re...

متن کامل

Speaker Change Detection Using Binary Key Modelling with Contextual Information

2017

Jose Patino Héctor Delgado Nicholas W. D. Evans

Speaker change detection can be of benefit to a number of different speech processing tasks such as speaker diarization, recognition and detection. Current solutions rely either on highly localized data or on training with large quantities of background data. While efficient, the former tend to over-segment. While more stable, the latter are less efficient and need adaptation to mis-matching da...

متن کامل

Speaker diarization for multiple distant microphone meetings: mixing acoustic features and inter-channel time differences

2006

José Manuel Pardo Xavier Anguera Miró Chuck Wooters

Speaker diarization for recordings made in meetings consists of identifying the number of participants in each meeting and creating a list of speech time intervals for each participant. In recently published work [7] we presented some experiments using only TDOA values (Time Delay Of Arrival for different channels) applied to this task. We demonstrated that information in those values can be us...

متن کامل

A Survey of Speaker Recognition: Fundamental Theories, Recognition Methods and Opportunities

Journal: :IEEE Access 2021

متن کامل

noise and transmission channel degradation compensation and score normalization using a robust hybrid speaker verification and identification system

Journal: :the modares journal of electrical engineering 2004

mohammad mahdi homayounpour jahanshahe kabudian

a parallel hybrid system of hmm and gmm modeling techniques was implemented and used in a telephony speaker verification and identification system. spectral subtraction and weighted projection measure were used to render this system more robust against additional noise. cepstral mean subtraction method was also applied for the compensation of convolution noise due to transmission channel degrad...

متن کامل

Acoustic hole filling for sparse enrollment data using a cohort universal corpus for speaker recognition.

Journal: :The Journal of the Acoustical Society of America 2012

Jun-Won Suh John H L Hansen

In this study, the problem of sparse enrollment data for in-set versus out-of-set speaker recognition is addressed. The challenge here is that both the training speaker data (5 s) and test material (2~6 s) is of limited test duration. The limited enrollment data result in a sparse acoustic model space for the desired speaker model. The focus of this study is on filling these acoustic holes by h...

متن کامل

Improved I-vector-based Speaker Recognition for Utterances with Speaker Generated Non-speech sounds

Journal: :CoRR 2017

Sri Harsha Dumpala Ashish Panda Sunil Kumar Kopparapu

Conversational speech not only contains several variants of neutral speech but is also prominently interlaced with several speaker generated non-speech sounds such as laughter and breath. A robust speaker recognition system should be capable of recognizing a speaker irrespective of these variations in his speech. An understanding of whether the speaker-specific information represented by these ...

متن کامل

Speaker recognition with temporal cues in acoustic and electric hearing.

Journal: :The Journal of the Acoustical Society of America 2005

Michael Vongphoe Fan-Gang Zeng

Natural spoken language processing includes not only speech recognition but also identification of the speaker's gender, age, emotional, and social status. Our purpose in this study is to evaluate whether temporal cues are sufficient to support both speech and speaker recognition. Ten cochlear-implant and six normal-hearing subjects were presented with vowel tokens spoken by three men, three wo...

متن کامل

Comparison of human and machine-based lip-reading

2009

Sarah Hilder Richard Harvey Barry-John Theobald

We investigate the performance of a machine-based lip-reading system using both shape-only parameters and full shape and appearance parameters. Furthermore, we contrast the performance of a machine-based lip-reading system with human lip-reading ability. We find that the automated system outperforms human lip-readers. Curiously however, for relatively simple tasks there is little improvement in...

متن کامل