Structure Extraction from Decorated Characters Using Multiscale Images

نویسندگان

  • Shinichiro Omachi
  • Masaki Inoue
  • Hirotomo Aso
چکیده

Decorated characters are widely used in various documents. Practical optical character reader is required to deal with not only common fonts but also complex designed fonts. However, since appearances of decorated characters are complicated, most general character recognition systems cannot give good performances on decorated characters. In this paper, an algorithm that can extract character’s essential structure from a decorated character is proposed. This algorithm is applied in preprocessing of character recognition. The proposed algorithm consists of three procedures: global structure extraction, interpolation of structure and smoothing. By using multi-scale images, topographical features such as ridges and ravines are detected for structure extraction. Ridges are used for extracting global structure, and ravines are used for interpolation. Experimental results show character structures are clearly extracted from very complex decorated characters. Keywords—character recognition, OCR, decorated character, structure extraction

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Structure Extraction from Various Kinds of Decorated Characters Using Multi-Scale Images

Decorated characters are widely used in various documents. Practical optical character reader is required to deal with not only common fonts but also complex designed fonts. However, since appearances of decorated characters are complicated, most general character recognition systems cannot give good performances on decorated characters. In this paper, an algorithm that can extract character’s ...

متن کامل

The Extraction of Characters from Scene Image Using Mathematical Morphology

In understanding an image, the extraction of charcters existing in the image is considered to be important. Scene images are differ from document images, which are composed of characters and complicated background (i.e.photo, picture, or painting etc.) instead of white one, this make it difficult to be dealt with. In this paper, we introduce a new method t o extract characters from scene images...

متن کامل

Zernike Moment Feature Extraction for Handwritten Devanagari (Marathi) Compound Character Recognition

Compound character recognition of Devanagari script is one of the challenging tasks since the characters are complex in structure and can be modified by writing combination of two or more characters. These compound characters occurs 12 to 15% in the Devanagari Script. The moment based techniques are being successfully applied to several image processing problems and represents a fundamental too...

متن کامل

Robust extraction of characters from color scene image using mathematical morphology

Current character extraction systems for scene images are not robust for most real-world applications. In contrast, the system present here achieves robust performance by using morphological segmentation. This paper describes a new morphological segmentation algorithm { Di erential Top-hats (DTT). In addition, a complete system for extraction of characters from color scene images is presented. ...

متن کامل

Handwritten Character Recognition Using Multiscale Neural Network Training Technique

Advancement in Artificial Intelligence has lead to the developments of various “smart” devices. Character recognition device is one of such smart devices that acquire partial human intelligence with the ability to capture and recognize various characters in different languages. Firstly multiscale neural training with modifications in the input training vectors is adopted in this paper to acquir...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IEEE Trans. Pattern Anal. Mach. Intell.

دوره 23  شماره 

صفحات  -

تاریخ انتشار 2001