Out-Of-Vocabulary Detection and Confidence Measures for Speech Recognition Using Phone Models

نویسندگان

  • Arlindo Veiga
  • Cláudio Neves
  • Fernando Perdigão
  • Luís Sá
چکیده

This paper describes a fast and efficient method to detect out-of-vocabulary words and compute confidence measures in a command-based speech recognition system. The method uses a phone-loop model to reject out-of-vocabulary words and a filler model to compute a confidence measure for each accepted word present in the recognizer output. Tests with this method show that it achieves a good trade-off between falseacceptance versus false-rejection rate. The system runs in real time in a platform with low computational resources and operates in noisy environment conditions (industrial environments and inside vehicles).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic out-of-language detection based on confidence measures derived from LVCSR word and phone lattices

Confidence Measures (CMs) estimated from Large Vocabulary Continuous Speech Recognition (LVCSR) outputs are commonly used metrics to detect incorrectly recognized words. In this paper, we propose to exploit CMs derived from frame-based word and phone posteriors to detect speech segments containing pronunciations from non-target (alien) languages. The LVCSR system used is built for English, whic...

متن کامل

Spoken Term Detection for Persian News of Islamic Republic of Iran Broadcasting

Islamic Republic of Iran Broadcasting (IRIB) as one of the biggest broadcasting organizations, produces thousands of hours of media content daily. Accordingly, the IRIBchr('39')s archive is one of the richest archives in Iran containing a huge amount of multimedia data. Monitoring this massive volume of data, and brows and retrieval of this archive is one of the key issues for this broadcasting...

متن کامل

Confidence measures from local posterior probability estimates

In this paper we introduce a set of related confidence measures for large vocabulary continuous speech recognition (LVCSR) based on local phone posterior probability estimates output by an acceptor HMM acoustic model. In addition to their computational efficiency, these confidence measures are attractive as they may be applied at the state-, phone-, wordor utterance-levels, potentially enabling...

متن کامل

Confidence measures for hybrid HMM/ANN speech recognition

In this paper we introduce four acoustic confidence measures which are derived from the output of a hybrid HMM/ANN large vocabulary continuous speech recognition system. These confidence measures, based on local posterior probability estimates computed by an ANN, are evaluated at both phone and word levels, using the North American Business News corpus.

متن کامل

Weighting Phone Confidence Measures for Automatic Speech Recognition

One of the most useful applications of Confidence Measures (CMs) in Automatic Speech Recognition systems is early detection of incorrect recognition hypotheses. A purely acoustic basis for such a CM is particularly important when tracking errors resulting from Out of Vocabulary speech, background noise or keyword substitution. A commonly taken approach is to compute scores on subword units of t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009