A feature-based hierarchical speech recognition system for Hindi
نویسندگان
چکیده
منابع مشابه
Ensemble Feature Extraction Modules for Improved Hindi Speech Recognition System
Speech is the most natural way of communication between human beings. The field of speech recognition generates intrigues of man – machine conversation and due to its versatile applications; automatic speech recognition systems have been designed. In this paper we are presenting a novel approach for Hindi speech recognition by ensemble feature extraction modules of ASR systems and their outputs...
متن کاملA large-vocabulary continuous speech recognition system for Hindi
In this paper we present a Hindi Speech Recognition system which has been trained on 40 hours of audio data and a has a trigram language model that is trained with 3 million words. For a vocabulary size of 65000 words, the system gives a word accuracy of 75% to 95%.
متن کاملHindi Speech Recognition System Using Htk
Speech recognition is the process of converting an acoustic waveform into the text similar to the information being conveyed by the speaker. In the present era, mainly Hidden Markov Model (HMMs) based speech recognizers are used. This paper aims to build a speech recognition system for Hindi language. Hidden Markov Model Toolkit (HTK) is used to develop the system. It recognizes the isolated wo...
متن کاملUsing Gaussian Mixtures for Hindi Speech Recognition System
The goal of automatic speech recognition (ASR) system is to accurately and efficiently convert a speech signal into a text message independent of the device, speaker or the environment. In general the speech signal is captured and pre-processed at front-end for feature extraction and evaluated at back-end using the Gaussian mixture hidden Markov model. In this statistical approach since the eva...
متن کاملAn auditory system-based feature for robust speech recognition
An auditory feature extraction algorithm for robust speech recognition in adverse acoustic environments is presented. The feature computation is comprised of an outer-middle-ear transfer function, FFT, frequency conversion from linear to the Bark scale, auditory filtering, nonlinearity, and discrete cosine transform. The feature is evaluated in two tasks: connected-digit recognition and large v...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Sadhana
سال: 1998
ISSN: 0256-2499,0973-7677
DOI: 10.1007/bf02745745