Context-dependent hybrid HME/HMM speech recognition using polyphone clustering decision trees

نویسندگان

  • Jürgen Fritsch
  • Michael Finke
  • Alexander H. Waibel
چکیده

This paper presents a context-dependent hybrid connectionist speech recognition system that uses a set of generalized hierarchical mixtures of experts (HME) to estimate context-dependent posterior acoustic class probabilities. The connectionist part of the system is organized in a modular fashion, allowing the distributed training of such a system on regular workstations. Context classes are based on polyphonic contexts, clustered using decision trees which we adopt from our continuous density HMM recognizer JANUS [8]. The system is evaluated on ESST, an english speaker-independent spontaneous speech database. Context dependent modeling is shown to yield signi cant improvements over simple context-independent modeling, requiring only small additional overhead in terms of training and decoding time.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Acoustic modeling based on model structure annealing for speech recognition

This paper proposes an HMM training technique using multiple phonetic decision trees and evaluates it in speech recognition. In the use of context dependent models, the decision tree based context clustering is applied to find a parameter tying structure. However, the clustering is usually performed based on statistics of HMM state sequences which are obtained by unreliable models without conte...

متن کامل

Context-dependent HMM modeling using tree-based clustering for the recognition of handwritten words

This paper presents an HMM-based recognizer for the off-line recognition of handwritten words. Word models are the concatenation of context-dependent character models (trigraphs). The trigraph models we consider are similar to triphone models in speech recognition, where a character adapts its shape according to its adjacent characters. Due to the large number of possible context-dependent mode...

متن کامل

Hierarchical mixtures of experts methodology applied to continuous speech recognition

In this paper, we incorporate the Hierarchical Mixtures of Experts (HME) method of probability estimation, developed by Jordan [1], into an HMMbased continuous speech recognition system. The resulting system can be thought of as a continuous-density HMM system, but instead of using gaussian mixtures, the HME system employs a large set of hierarchically organized but relatively small neural netw...

متن کامل

Bayesian context clustering using cross valid prior distribution for HMM-based speech recognition

Decision tree based context clustering [Young; '94] ・ Construct a parameter tying structure ・ Can estimate robust parameter ・ Can generate unseen context dependent models ・ Minimum description length (MDL) criterion [Shinoda; '97] Bayesian approach ・ Variational Bayesian (VB) method [Attias; '99] ⇒ Applied to speech recognition [Watanabe; '04] ・ Can use prior information ⇒ Affect context cluste...

متن کامل

Acoustic modeling based on the MDL principle for speech recognition

Recently context-dependent phone units, such as triphones, have been used to model subword units in speech recognition based on Hidden Markov Models (HMMs). While most such methods employ clustering of the HMM parameters(e.g., subword clustering, state clustering, etc.), to control HMM size so as to avoid poor recognition accuracy due to an insu ciency of training data, none of them provide any...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997