SNR-invariant PLDA modeling for robust speaker verification
نویسندگان
چکیده
In spite of the great success of the i-vector/PLDA framework, speaker verification in noisy environments remains a challenge. To compensate for the variability of i-vectors caused by different levels of background noise, this paper proposes a new framework, namely SNR-invariant PLDA, for robust speaker verification. By assuming that i-vectors extracted from utterances falling within a narrow SNR range share similar SNRspecific information, the paper introduces an SNR factor to the conventional PLDA model. Then, the SNR-related variability and the speaker-related variability embedded in the i-vectors are modeled by the SNR factor and the speaker factor, respectively. Accordingly, an i-vector is represented by a linear combination of three components: speaker, SNR, and channel. During verification, the variability due to SNR and channels are marginalized out when computing the marginal likelihood ratio. Experiments based on NIST 2012 SRE show that SNR-invariant PLDA achieves superior performance when compared with the conventional PLDA and SNR-dependent mixture of PLDA.
منابع مشابه
Discriminative subspace modeling of SNR and duration variabilities for robust speaker verification
Although i-vectors together with probabilistic LDA (PLDA) have achieved a great success in speaker verification, how to suppress the undesirable effects caused by the variability in utterance length and background noise level is still a challenge. This paper aims to improve the robustness of i-vector based speaker verification systems by compensating for the utterance-length variability and noi...
متن کاملNoise robust speaker verification via the fusion of SNR-independent and SNR-dependent PLDA
While i-vectors with probabilistic linear discriminant analysis (PLDA) can achieve state-of-the-art performance in speaker verification, the mismatch caused by acoustic noise remains a key factor affecting system performance. In this paper, a fusion system that combines a multi-condition SNR-independent PLDA model and a mixture of SNR-dependent PLDA models is proposed to make speaker verificati...
متن کامل1 SNR - INVARIANT PLDA 1 Lecture Notes on SNR - Invariant PLDA
This document provides the derivations of the equations in the paper: Na Li and M.W. Mak, “SNR-Invariant PLDA Modeling in Nonparametric Subspace for Robust Speaker Verification”, IEEE/ACM Trans. on Audio Speech and Language Processing, vol. 23, no. 10, pp. 1648-1659, Oct. 2015. Please cite this document as: M.W. Mak, Lecture Notes on SNR-Invariant PLDA, Technical Report and Lecture Note Series,...
متن کاملSNR-dependent mixture of PLDA for noise robust speaker verification
This paper proposes a mixture of SNR-dependent PLDA models to provide a wider coverage on the i-vector spaces so that the resulting i-vector/PLDA system can handle test utterances with a wide range of SNR. To maximise the coordination among the PLDA models, they are trained simultaneously via an EM algorithm using utterances contaminated with noise at various levels. The contribution of a train...
متن کاملDomain Mismatch Modeling of Out-Domain i-Vectors for PLDA Speaker Verification
The state-of-the-art i-vector based probabilistic linear discriminant analysis (PLDA) trained on non-target (or outdomain) data significantly affects the speaker verification performance due to the domain mismatch between training and evaluation data. To improve the speaker verification performance, sufficient amount of domain mismatch compensated out-domain data must be used to train the PLDA ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015