Detection of Bold and Italic Character in Gurmukhi Script

نویسنده

  • Harjit Singh
چکیده

Working with Optical Character Recognition for the printed Gurmukhi Script is a challenging task due to the large number of characters, the sophisticated ways in which they combine, and the complicated result. This paper describes a fast and easy to implement algorithm for detection of bold and italic character in Gurmukhi Script. The algorithm works without recognition of actual character and detects the font style (bold or italic) in the way of weight and slope. The procedure of identification and classification of bold and italic character can be used to improve character recognition. This simple and fast algorithm gives high accuracy.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Detection of Bold and Italic Character in Devanagari Script

Only a few works has been done for printed devanagari text in the area of optical character recognition. In this paper there is describing about a simple and fast algorithm for detection of italic and bold character in Devanagari script, without recognition of actual character. Here present an automatic information which tells us about the font type phase in the way of weight and slope. The pro...

متن کامل

Word Disambiguation in Shahmukhi to Gurmukhi Transliteration

To write Punjabi language, Punjabi speakers use two different scripts, Perso-Arabic (referred as Shahmukhi) and Gurmukhi. Shahmukhi is used by the people of Western Punjab in Pakistan, whereas Gurmukhi is used by most people of Eastern Punjab in India. The natural written text in Shahmukhi script has missing short vowels and other diacritical marks. Additionally, the presence of ambiguous chara...

متن کامل

A Hybrid Approach to Classify Gurmukhi Script Characters

Researchers have worked extensively on OCR, in the past few decades. This is also visible from the fact that various types of OCR are available in the market. Out of these available OCR’s majority is to support foreign languages. In Indian context, majority of available OCR’s are for Hindi and Bangla, but a very few reports are available on Gurmukhi script which is used to write Punjabi languag...

متن کامل

Segmentation Problems and Solutions in Printed Degraded Gurmukhi Script

Character segmentation is an important preprocessing step for text recognition. In degraded documents, existence of touching characters decreases recognition rate drastically, for any optical character recognition (OCR) system. In this paper we have proposed a complete solution for segmenting touching characters in all the three zones of printed Gurmukhi script. A study of touching Gurmukhi cha...

متن کامل

Conversion between Scripts of Punjabi: Beyond Simple Transliteration

This paper describes statistical techniques used for modelling transliteration systems between the scripts of Punjabi language. Punjabi is one of the unique languages, which are written in more than one script. In India, Punjabi is written in Gurmukhi script, while in Pakistan it is written in Shahmukhi (Perso-Arabic) script. Shahmukhi script has its origin in the ancient Phoenician script wher...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012