Combining Neural Networks and Hidden Markov Models M for Continuous Speech Recognition
نویسندگان
چکیده
Pure MLP-based approaches have not previously been t demonstrated to function well for continuous-speech recogni ion because of the need for accurate segmentation of the w speech signal. HMMs, on the other hand, provide a frame ork for simultaneous segmentation and classification of t speech, which has been demonstrated to be useful for con inuous recognition. Previous work by Morgan and Bourlard d H [1] has shown theoretically and practically that MLPs an MMs can be combined by using MLPs for the estimation , t of the HMM state-dependent observation probabilities hereby exploiting the advantages of both approaches.
منابع مشابه
Improving Phoneme Sequence Recognition using Phoneme Duration Information in DNN-HSMM
Improving phoneme recognition has attracted the attention of many researchers due to its applications in various fields of speech processing. Recent research achievements show that using deep neural network (DNN) in speech recognition systems significantly improves the performance of these systems. There are two phases in DNN-based phoneme recognition systems including training and testing. Mos...
متن کاملشبکه عصبی پیچشی با پنجرههای قابل تطبیق برای بازشناسی گفتار
Although, speech recognition systems are widely used and their accuracies are continuously increased, there is a considerable performance gap between their accuracies and human recognition ability. This is partially due to high speaker variations in speech signal. Deep neural networks are among the best tools for acoustic modeling. Recently, using hybrid deep neural network and hidden Markov mo...
متن کاملAn Automatic Dysarthric Speech Recognition Approach using Deep Neural Networks
Transcribing dysarthric speech into text is still a challenging problem for the state-of-the-art techniques or commercially available speech recognition systems. Improving the accuracy of dysarthric speech recognition, this paper adopts Deep Belief Neural Networks (DBNs) to model the distribution of dysarthric speech signal. A continuous dysarthric speech recognition system is produced, in whic...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1992