Scene Text Detection Based on Multi-Headed Self-Attention Using Shifted Windows
نویسندگان
چکیده
Scene text detection has become a popular topic in computer vision research. Most of the current research is based on deep learning, using Convolutional Neural Networks (CNNs) to extract visual features images. However, due limitations convolution kernel size, CNNs can only local images with small perceptual fields, and they cannot obtain more global features. In this paper, improve accuracy scene detection, feature enhancement module added model. This acquires an image by computing multi-headed self-attention map. The improved model extracts CNNs, while extracting through module. extracted both these are then fused ensure that at different levels extracted. A shifted window used calculation self-attention, which reduces computational complexity from second power input width-height product first power. Experiments conducted multi-oriented dataset ICDAR2015 multi-language MSRA-TD500. Compared pre-improvement method DBNet, F1-score improves 0.5% 3.5% MSRA-TD500, respectively, indicating effectiveness improvement.
منابع مشابه
Fused Text Segmentation Networks for Multi-oriented Scene Text Detection
In this paper, we introduce a novel end-end framework for multi-oriented scene text detection from an instanceaware segmentation perspective. We present Fused Text Segmentation Networks, which combine multi-level features during feature extracting as text instance may rely on finer feature expression compared to general objects. It detects and segments the text instance jointly and simultaneous...
متن کاملNatural scene text localization using edge color signature
Localizing text regions in images taken from natural scenes is one of the challenging problems dueto variations in font, size, color and orientation of text. In this paper, we introduce a new concept socalled Edge Color Signature for localizing text regions in an image. This method is able to localizeboth Farsi and English texts. In the proposed method rst a pyramid using diff...
متن کاملText Detection in Multi-Oriented Natural Scene Images
---------------------------------------------------------------------***--------------------------------------------------------------------Abstract With the growing number of digital multimedia libraries, the need to efficiently index, browse and retrieve multimedia information is increased. Text embedded in images and video frames can help to identify the image information (e.g. somebody's na...
متن کاملScene Text Detection via Holistic, Multi-Channel Prediction
Recently, scene text detection has become an active research topic in computer vision and document analysis, because of its great importance and significant challenge. However, vast majority of the existing methods detect text within local regions, typically through extracting character, word or line level candidates followed by candidate aggregation and false positive elimination, which potent...
متن کاملObject Attention Patches for Text Detection and Recognition in Scene Images using SIFT
Natural urban scene images contain many problems for character recognition such as luminance noise, varying font styles or cluttered backgrounds. Detecting and recognizing text in a natural scene is a difficult problem. Several techniques have been proposed to overcome these problems. These are, however, usually based on a bottom-up scheme, which provides a lot of false positives, false negativ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Applied sciences
سال: 2023
ISSN: ['2076-3417']
DOI: https://doi.org/10.3390/app13063928