Slant estimation and core-region detection for handwritten Latin words
نویسندگان
چکیده
In this paper, we present a new technique that estimates the slant in handwritten words while a new word core-region detection method is introduced as part of the proposed technique. The proposed core-region detection algorithm can be also used independently to detect the upper and lower baselines of a word. Our method takes advantage of the orientation of the nonhorizontal strokes of Latin characters as well as their location regarding to the word’s core-region. As a first step, the word core-region is detected with the use of novel reinforced horizontal black run profiles which permits to detect the coreregion scan lines more accurately. Then, the near-horizontal parts of the document word are extracted and the orientation and the height of non-horizontal remaining fragments as well as their location in relation to the word’s core-region are calculated. Word slant is estimated taking into consideration the orientation and the height of each fragment while an additional weight is applied if a fragment is partially outside the core-region of the word which indicates that this fragment corresponds to a part of the character stroke that has a significant contribution to the overall word slant and should by definition be vertical to the orientation of the word. Extensive experimental results prove the efficiency of the proposed slant estimation method compared to current state-of-the-art algorithms. 2012 Elsevier B.V. All rights reserved.
منابع مشابه
Nonuniform Slant Correction for Handwritten Word Recognition
Slant correction is a preprocessing technique to improve segmentation and recognition accuracy for handwritten word recognition. All conventional slant correction techniques were performed by the estimation of the average slant angle and the shear transformation. In this paper, a nonuniform slant correction technique for handwritten word recognition is proposed where the slant correction proble...
متن کاملAccuracy Improvement of Slant Estimation for Handwritten Words
Handwritten words are usually slant or Italicized due to the mechanism of handwriting and the personality. In order to improve the accuracy of character segmentation and recognition, the authors proposed a chain code method for the slant estimation and correction[1]. However the method is very simple and usually gives good estimate of the word slant, there was a problem such that the slant tend...
متن کاملSlant Correction for Offline Handwritten Telugu Isolated Characters and Cursive Words
Slant correction is one of the preprocessing steps for the character recognition. This proposed method is simple and efficient for correcting the slant of offline handwritten Telugu isolated characters and cursive words. This method is motivated from the chain code of the word contour slant correction method but this proposed method does not identify the chain code of the word or character cont...
متن کاملComponent-based Segmentation of Words from Handwritten Arabic Text
Efficient preprocessing is very essential for automatic recognition of handwritten documents. In this paper, techniques on segmenting words in handwritten Arabic text are presented. Firstly, connected components (ccs) are extracted, and distances among different components are analyzed. The statistical distribution of this distance is then obtained to determine an optimal threshold for words se...
متن کاملConnected Component Based Word Spotting on Persian Handwritten image documents
Word spotting is to make searchable unindexed image documents by locating word/words in a doc-ument image, given a query word. This problem is challenging, mainly due to the large numberof word classes with very small inter-class and substantial intra-class distances. In this paper, asegmentation-based word spotting method is presented for multi-writer Persian handwritten doc-...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Pattern Recognition Letters
دوره 35 شماره
صفحات -
تاریخ انتشار 2014