A Complete Bangla OCR System for Printed Chracters
نویسندگان
چکیده
Bangla character recognition is very important field of research because Bangla is most popular language in the Indian subcontinent. Research on Bangla character recognition has been started since mid 1980’s. Different types of techniques already applied and the performance is examined. This paper is a complete Optical Character Recognition (OCR) system for printed Bangla characters. Preprocessing steps includes binarization, noise removal, skew detection and correction, segmentation in various levels and scaling. Features are extracted from scaled character and Freeman chain code used for representing a character. Multilayer feed forward neural network is used to classify and recognize character.
منابع مشابه
A Survey on Script Segmentation for Bangla OCR
Script segmentation is an important primary task for any Optical Character Recognition (OCR) software. Especially, in case of off-line OCR for printed character, it has more importance. Through script segmentation a big image of some written document is fragmented into a number of small pieces which are then used for pattern matching to determine the expected sequence of characters. In the impl...
متن کاملA Complete Workflow for Development of Bangla OCR
Developing a Bangla OCR requires bunch of algorithm and methods. There were many effort went on for developing a Bangla OCR. But all of them failed to provide an error free Bangla OCR. Each of them has some lacking. We discussed about the problem scope of currently existing Bangla OCR‟s. In this paper, we present the basic steps required for developing a Bangla OCR and a complete workflow for d...
متن کاملA Complete Machine printed Gurmukhi OCR System
Recognition of Indian language scripts is a challenging problem. Work for the development of complete OCR systems for Indian language scripts is still in infancy. Complete OCR systems have recently been developed for Devanagri and Bangla scripts. Research in the field of recognition of Gurmukhi script faces major problems mainly related to the unique characteristics of the script like connectiv...
متن کاملMachine-printed and hand-written text lines identification
There are many types of documents where machine-printed and handwritten texts intermixedly appear. Since the optical character recognition (OCR) methodologies for machine-printed and handwritten texts are dierent, to achieve optimal performance it is necessary to separate these two types of texts before feeding them to their respective OCR systems. In this paper, we present a machine-printed a...
متن کاملHandwritten Bangla Alphabet Recognition using an MLP Based Classifier
The work presented here involves the design of a Multi Layer Perceptron (MLP) based classifier for recognition of handwritten Bangla alphabet using a 76 element feature set Bangla is the second most popular script and language in the Indian subcontinent and the fifth most popular language in the world. The feature set developed for representing handwritten characters of Bangla alphabet includes...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010