SNR-invariant PLDA modeling for robust speaker verification

نویسندگان

Na Li

Man-Wai Mak

چکیده

In spite of the great success of the i-vector/PLDA framework, speaker verification in noisy environments remains a challenge. To compensate for the variability of i-vectors caused by different levels of background noise, this paper proposes a new framework, namely SNR-invariant PLDA, for robust speaker verification. By assuming that i-vectors extracted from utterances falling within a narrow SNR range share similar SNRspecific information, the paper introduces an SNR factor to the conventional PLDA model. Then, the SNR-related variability and the speaker-related variability embedded in the i-vectors are modeled by the SNR factor and the speaker factor, respectively. Accordingly, an i-vector is represented by a linear combination of three components: speaker, SNR, and channel. During verification, the variability due to SNR and channels are marginalized out when computing the marginal likelihood ratio. Experiments based on NIST 2012 SRE show that SNR-invariant PLDA achieves superior performance when compared with the conventional PLDA and SNR-dependent mixture of PLDA.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Discriminative subspace modeling of SNR and duration variabilities for robust speaker verification

Although i-vectors together with probabilistic LDA (PLDA) have achieved a great success in speaker verification, how to suppress the undesirable effects caused by the variability in utterance length and background noise level is still a challenge. This paper aims to improve the robustness of i-vector based speaker verification systems by compensating for the utterance-length variability and noi...

متن کامل

Noise robust speaker verification via the fusion of SNR-independent and SNR-dependent PLDA

While i-vectors with probabilistic linear discriminant analysis (PLDA) can achieve state-of-the-art performance in speaker verification, the mismatch caused by acoustic noise remains a key factor affecting system performance. In this paper, a fusion system that combines a multi-condition SNR-independent PLDA model and a mixture of SNR-dependent PLDA models is proposed to make speaker verificati...

متن کامل

1 SNR - INVARIANT PLDA 1 Lecture Notes on SNR - Invariant PLDA

This document provides the derivations of the equations in the paper: Na Li and M.W. Mak, “SNR-Invariant PLDA Modeling in Nonparametric Subspace for Robust Speaker Verification”, IEEE/ACM Trans. on Audio Speech and Language Processing, vol. 23, no. 10, pp. 1648-1659, Oct. 2015. Please cite this document as: M.W. Mak, Lecture Notes on SNR-Invariant PLDA, Technical Report and Lecture Note Series,...

متن کامل

SNR-dependent mixture of PLDA for noise robust speaker verification

This paper proposes a mixture of SNR-dependent PLDA models to provide a wider coverage on the i-vector spaces so that the resulting i-vector/PLDA system can handle test utterances with a wide range of SNR. To maximise the coordination among the PLDA models, they are trained simultaneously via an EM algorithm using utterances contaminated with noise at various levels. The contribution of a train...

متن کامل

Domain Mismatch Modeling of Out-Domain i-Vectors for PLDA Speaker Verification

The state-of-the-art i-vector based probabilistic linear discriminant analysis (PLDA) trained on non-target (or outdomain) data significantly affects the speaker verification performance due to the domain mismatch between training and evaluation data. To improve the speaker verification performance, sufficient amount of domain mismatch compensated out-domain data must be used to train the PLDA ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2015

SNR-invariant PLDA modeling for robust speaker verification

نویسندگان

چکیده

منابع مشابه

Discriminative subspace modeling of SNR and duration variabilities for robust speaker verification

Noise robust speaker verification via the fusion of SNR-independent and SNR-dependent PLDA

1 SNR - INVARIANT PLDA 1 Lecture Notes on SNR - Invariant PLDA

SNR-dependent mixture of PLDA for noise robust speaker verification

Domain Mismatch Modeling of Out-Domain i-Vectors for PLDA Speaker Verification

عنوان ژورنال:

اشتراک گذاری