Scene text detection has become a popular topic in computer vision research. Most of the current research is based on deep learning, using Convolutional Neural Networks (CNNs) to extract visual features images. However, due limitations convolution kernel size, CNNs can only local images with small perceptual fields, and they cannot obtain more global features. In this paper, improve accuracy sc...