Speaker adaptation using maximum likelihood model interpolation

نویسندگان

  • Zuoying Wang
  • Feng Liu
چکیده

A speaker adaptation scheme named maximum likelihood model interpolation (MLMI) is proposed. The basic idea of MLMI is to compute the speaker adapted (SA) model of a test speaker by a linear convex combination of a set of speaker dependent (SD) models. Given a set of training speakers, we first calculate the corresponding SD models for each training speaker as well as the speakerindependent (SI) models. Then, the mean vector of the SA model is computed as the weighted sum of the set of the SD mean vectors, while the covariance matrix is the same as that of the SI model. An algorithm to estimate the weight parameters is given which maximizes the likelihood of the SA model given the adaptation data. Experiments show that 3 adaptation sentences can give a signaificant performance improvement. As the number of SD models increases, further improvement can be obtained.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Rapid Speaker Adaptation With Speaker Clustering

This thesis addresses the problem of rapid speaker adaptation. This is the task of altering the parameters of a speaker dependent speech recognition system so as to make that system look more like a speaker dependent system using a very small amount (<10 seconds) of speaker specific data. The approach to speaker adaptation taken in this work is called speaker cluster weighting (SCW). SCW extend...

متن کامل

Linguistic tree based maximum likelihood model interpolation

In this paper, a speaker adaptation method is presented which computes the speaker adapted model by a weighted sum of a set of speaker dependent models. The set of weights are estimated to maximize the likelihood of the adaptation data. Then a linguistic tree is constructed to cluster the mean vectors. The means in the same linguistic class share the same weight set, while the means in differen...

متن کامل

Cluster adaptive training for speech recognition

When performing speaker adaptation there are two conicting requirements. First the transform must be powerful enough to represent the speaker. Second the transform must be quickly and easily estimated for any particular speaker. Recently the most popular adaptation schemes have used many parameters to adapt the models. This limits how rapidly the models may be adapted. This paper examines an ad...

متن کامل

Discriminative Linear Transforms for Speaker Adaptation

Linear transform adaptation techniques such as Maximum Likelihood Linear Regression (MLLR) are a popular and effective family of methods for speaker adaptation. MLLR estimates transform parameters for Gaussian means and variances using a maximum likelihood (ML) objective function. This paper discusses the use of an alternative discriminative objective function for linear transform estimation, w...

متن کامل

Improvement of MLLR Speaker Adaptation Using a Novel Method

This paper presents a technical speaker adaptation method called WMLLR, which is based on maximum likelihood linear regression (MLLR). In MLLR, a linear regression-based transform which adapted the HMM mean vectors was calculated to maximize the likelihood of adaptation data. In this paper, the prior knowledge of the initial model is adequately incorporated into the adaptation. A series of spea...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999