Verifying LVCSR Output at Different Levels with Generalized Posterior Probability
نویسندگان
چکیده
Generalized posterior probability (GPP), a statistical confidence measure, is used for verification of large vocabulary continuous speech recognition (LVCSR) output at subword, word and utterance levels. GPP is obtained by combining exponentially and optimally weighted products of acoustic and language model scores for reappeared units in the reduced search space (e.g., word graph). Experimental results have demonstrated the effectiveness of GPP for verifying LVCSR output at all three levels. Keyword confidence measure, posterior probability, large vocabulary continuous speech recognition 1 The author is now with Microsoft Research Asia.
منابع مشابه
Generalized Posterior Probability for Verifying Recognized Words Optimally in Microphone Array Applications
In a large vocabulary, continuous speech recognition (LVCSR) system, spoken input is converted into a string of hypothesized, possibly erroneous, words. However, the current state-of-the-art speech recognition technology is still not robust to all variability in speech signals, especially in a hands-free application. To make the signal pick-up from a speech source more immuned to noise or room ...
متن کاملContext constrained-generalized posterior probability for verifying phone transcriptions
A new statistical confidence measure, Context ConstrainedGeneralized Posterior probability (CC-GPP), is proposed for verifying phone transcriptions in speech databases. Different from generalized posterior probability (GPP), CC-GPP is computed by considering string hypotheses that bear a focused phone with partially matched left and right contexts. Parameters used for CC-GPP include context win...
متن کاملBackground model based posterior probability for measuring confidence
Word posterior probability (WPP) computed over LVCSR word graphs has been used successfully in measuring confidence of speech recognition output. However, for certain applications the word graph is too sparse to warrant reliable WPP estimation. In this paper, we incorporate subword units as background models to generate a subword graph for estimating posterior probability. Experiments on both E...
متن کاملConfidence measures from local posterior probability estimates
In this paper we introduce a set of related confidence measures for large vocabulary continuous speech recognition (LVCSR) based on local phone posterior probability estimates output by an acceptor HMM acoustic model. In addition to their computational efficiency, these confidence measures are attractive as they may be applied at the state-, phone-, wordor utterance-levels, potentially enabling...
متن کاملPerformance improvement of dialog speech translation by rejecting unreliable utterances
We discuss how to measure the reliability of recognized utterances based on a confidence measure, and applied it to a dialog speech translation system. In this study, we employ generalized word posterior probability (GWPP), a confidence measure for verifying recognized words, and expand it to measure the reliability of recognized utterances. We confirmed the performance improvement by applying ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004