Document Image Binarization Technique for Degraded Document Images
نویسندگان
چکیده
Document image binarization is a vital pre-processing technique for document image analysis that segments text from badly degraded document images. In this paper, we propose a robust document image binarization technique that is based on the concept of adaptive image contrast. The adaptive image contrast which is formed by combining local image contrast and the local image gradient makes it tolerant to text and background variation caused by different types of document degradations. In the proposed technique the adaptive contrast map is binarized and text stroke edge pixels are detected using Canny's algorithm. The document text is further segmented by a local threshold that is assessed in light of the intensities of detected text stroke edge pixels within a local window. The above mentioned process has been rehashed by combining adaptive image contrast with Sobel's Edge detection technique and Total Variation Edge Detection technique respectively A comparison between these techniques is then made on the basis of Peak-signal to Noise Ratio and Mean Square Error values. These methods have been tested on images suffering from different types of degradations . It has been found out that adaptive image contrast used with Canny's edge detection technique gives the best results.
منابع مشابه
Ancient Document Images Enhancement Using Phase Based Binarization
In this paper, we present a phase-based binarization model for degraded document images, also a post processing method that can improve any binarization method and a ground truth generation tool. Usually, many binarization techniques are implemented in the literature for different types of binarization problems. It include an adaptive image contrast based document image binarization technique t...
متن کاملA Survey on Degraded Document Image Binarization Techniques
the method of segmentation in the image binarization technique is the major technique used for the separation of pixel values into dual collections, black as foreground and white as background. The degraded images of a document are segmented by using the image binarization technique in order to acquire the clear images exact to that of the original images of documents. Thresholding process is t...
متن کاملAn Improved Contrast Image Based Document Image Binarization Technique for Degraded Document Images
Document Image Binarization converts a gray-scale document image into binary document image .It is usually performed in the pre-processing stage of document image analysis and it aims to segment the foreground text from the document background. Segmentation of foreground text from the document background is a difficult task in the case of degraded document images. In this paper we propose a sim...
متن کاملBinarization of Document Image
Documents Image Binarization is performed in the preprocessing stage for document analysis and it aims to segment the foreground text from the document background. A fast and accurate document image binarization technique is important for the ensuing document image processing tasks such as optical character recognition (OCR). Though document image binarization has been studied for many years, t...
متن کاملA Robust Document Image Binarization Technique for Degraded Document Images
Segmentation of text from badly degraded document images is a very challenging task due to the high inter/intravariation between the document background and the foreground text of different document images. In this paper, we propose a novel document image binarization technique that addresses these issues by using adaptive image contrast. The adaptive image contrast is a combination of the loca...
متن کاملAn Analysis of Image Denoising and Restoration of Handwritten Degraded Document Images
The restoration of a blurry or noisy image is commonly performed with a MAP estimator, which maximizes a posterior probability to reconstruct a clean image from a degraded image. A MAP estimator, when used with a sparse gradient image prior, reconstructs piecewise smooth images and typically removes textures that are important for visual realism. The three public datasets that were used in the ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015