Recognition of Superimposed Caption
نویسندگان
چکیده
The automatic extraction and reading of news captions and annotations can be of great help locating topics of interest in digital news video archives. To achieve this goal, we present a technique, called Video OCR, which detects, extracts, and reads text areas in digital video data. In this paper, we address problems, describe the method by which Video OCR operates, and suggest applications for its use in digital news archives. To solve two problems of character recognition for videos, low resolution characters and extremely complex backgrounds, we apply an interpolation lter, multi-frame integration and a combination of four lters. Segmenting characters is done by a recognition-based segmentation method, and intermediate character recognition results are used to improve the segmentation. We also include a method for locating text areas using the text-like properties and the use of a language-based post-processing technique to increase word recognition rates. The overall recognition results are satisfactory for use in news indexing. Performing Video OCR on news video and combining its results with other video understanding techniques will improve the overall understanding of the news video content.
منابع مشابه
Effects of Closed-caption Programs on EFL Learners’ Listening Comprehension and Vocabulary Learning
This study aimed at investigating the impact of closed-caption program on listening comprehension of English movies and vocabulary learning. Sixty-four graduate students studying at Shiraz Islamic Azad University were selected as the participants of the study. The participants were divided into two groups: experimental group (with closed caption program) and control group (without closed captio...
متن کاملGeneral and domain-specific techniques for detecting and recognizing superimposed text in video
We have developed generic and domain-specific video algorithms for caption text extraction and recognition in digital video. Our system includes several unique features: for caption box location, we combine the compressed-domain features derived from DCT coefficients and motion vectors. Long-term temporal consistency is employed to enhance localization performance. For character segmentation, w...
متن کاملA spatial-temporal approach for video caption detection and recognition
We present a video caption detection and recognition system based on a fuzzy-clustering neural network (FCNN) classifier. Using a novel caption-transition detection scheme we locate both spatial and temporal positions of video captions with high precision and efficiency. Then employing several new character segmentation and binarization techniques, we improve the Chinese video-caption recogniti...
متن کاملApplication of natural language processing and speech processing technology to production of closed-caption TV programs for the hearing impaired
Television service is indispensable to a human life in the modern age. The people who are seeing or hearing impaired, however, do not enjoy the TV programs as much as they want. In closed-caption service, the speech in TV programs are put into characters and superimposed on TV pictures for the bene t of hearing impaired people. We proposed in the paper natural language and speech processing tec...
متن کاملText Detection in Images and Video Sequences
Caption text or superimposed text provides valuable information about contents in images and video sequences. In this paper, on one hand we present a general overview about text features and a classification of its extraction methods, and on the other hand we introduce our tree structure-based bottom-up approach to text extraction showing some promising results. The purpose of this work is to d...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1998