Automatic Speech Recognition of Urdu Digits with Optimal Classification Approach
نویسندگان
چکیده
Speech Recognition for Urdu language is an interesting and less developed task. This is primarily due to the fact that linguistic resources such as rich corpus are not available for Urdu. Yet, few attempts have been made for developing Urdu speech recognition frameworks using the traditional approaches such as Hidden Markov Models and Neural Networks. In this work, we investigate the use of three classification methods for Urdu speech recognition task. We extract the Mel Frequency Cepstral Coefficients, the delta and delta-delta features from the speech data and train the classifiers to perform Urdu speech recognition. We present the performance achieved by training a Support Vector Machine (SVM) classifier, a random forest (RF) classifier and a linear discriminant analysis classifier (LDA) for comparison with SVM. Consequently, the experimental results show that SVM gives better performance than RF and LDA classifiers on this particular task.
منابع مشابه
Linear Discriminant Analysis Based Approach for Automatic Speech Recognition of Urdu Isolated Words
Urdu is amongst the five largest languages of the world and enjoys extreme importance by sharing its vocabulary with several other languages of the Indo-Pak. However, there has not been any significant research in the area of Automatic Speech Recognition of Urdu. This paper presents the statistical based classification technique to achieve the task of Automatic Speech Recognition of isolated wo...
متن کاملDWT features performance analysis for automatic speech recognition of Urdu
This paper presents the work on Automatic Speech Recognition of Urdu language, using a comparative analysis for Discrete Wavelets Transform (DWT) based features and Mel Frequency Cepstral Coefficients (MFCC). These features have been extracted for one hundred isolated words of Urdu, each word uttered by ten different speakers. The words have been selected from the most frequently used words of ...
متن کاملAutomatic Recognition of Offline Handwritten Urdu Digits In Unconstrained Environment Using Daubechies Wavelet Transforms
This paper presents an optical character recognition system for the handwritten Urdu Digits. A lot of work has been done in recognition of characters and numerals of various languages like Devanagari, English, Chinese, and Arabic etc. But in case of handwritten Urdu Digits very less work has been reported. Different Daubechies Wavelet transforms are used in this work for feature extraction. Als...
متن کاملAccent Classification among Punjabi , Urdu , Pashto , Saraiki and Sindhi Accents of Urdu Language
Automatic Speech Recognition (ASR) is a key component in Human Computer Interaction (HCI) applications. Stability of ASR systems largely depends on accent, gender, age of speakers, background noise and channel variations. In this paper, a study has been conducted to classify five different accents of Urdu language spoken in Pakistan i.e. Punjabi, Urdu, Pashto, Saraiki and Sindhi. Speech data ha...
متن کاملA New Large Urdu Database for Off-Line Handwriting Recognition
A new large Urdu handwriting database, which includes isolated digits, numeral strings with/without decimal points, five special symbols, 44 isolated characters, 57 Urdu words (mostly financial related), and Urdu dates in different patterns, was designed at Centre for Pattern Recognition and Machine Intelligence (CENPARMI). It is the first database for Urdu off-line handwriting recognition. It ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015