closed caption program

Efficient Archiving and Content-Based Retrieval of Video Information on the Web

1997

Behzad Shahraray David C. Gibbon

This paper summarizes an ongoing work in multimedia processing aimed at the automated archiving and selective retrieval of textual, pictorial and auditory information contained in video programs. Video processing performs the task of representing the visual information using a small subset of the video frames. Linguistic processing refines the closed caption text, generates table of contents, a...

متن کامل

Automatic Closed Caption Detection and Filtering in MPEG Videos for Video Structuring

Journal: :J. Inf. Sci. Eng. 2006

Duan-Yu Chen Ming-Ho Hsiao Suh-Yin Lee

Video structuring is the process of extracting temporal structural information of video sequences and is a crucial step in video content analysis especially for sports videos. It involves detecting temporal boundaries, identifying meaningful segments of a video and then building a compact representation of video content. Therefore, in this paper, we propose a novel mechanism to automatically pa...

متن کامل

Context-Sensitive Complementary Information Retrieval for Text Stream

2005

Qiang Ma Katsumi Tanaka

With constant advances in information technology, more and more information is available and users’ information needs are becoming more diverse. Most conventional information systems only attempt to provide information that meets users’ specific interests. In contrast, we are working on ways of discovering information from the viewpoints of both interest and necessity. For example, we are tryin...

متن کامل

A Bayesian Framework for Fusing Multiple Word Knowledge Models in Videotext Recognition

2003

DongQing Zhang Shih-Fu Chang

Videotext recognition is challenging due to low resolution, diverse fonts/styles, and cluttered background. Past methods enhanced recognition by using multiple frame averaging, image interpolation and lexicon correction, but recognition using multi-modality language models has not been explored. In this paper, we present a formal Bayesian framework for videotext recognition by combining multipl...

متن کامل

Content-based Indexing of Captioned Video on the Viewstation

1995

David R. Bacher Christopher J. Lindblad

We have designed and constructed a mechanism for using caption text from broadcast television programs to analyze their content. In this paper, we describe the method by which captions are captured and translated from the raw video signal into text on the ViewStation. We also describe our Caption Parser, which analyzes the text and extracts information about the content of broadcast television ...

متن کامل

Discriminative data selection for lightly supervised training of acoustic model using closed caption texts

2015

Sheng Li Yuya Akita Tatsuya Kawahara

We present a novel data selection method for lightly supervised training of acoustic model, which exploits a large amount of data with closed caption texts but not faithful transcripts. In the proposed scheme, a sequence of the closed caption text and that of the ASR hypothesis by the baseline system are aligned. Then, a set of dedicated classifiers is designed and trained to select the correct...

متن کامل

Automatic Lecture Transcription Based on Discriminative Data Selection for Lightly Supervised Acoustic Model Training

Journal: :IEICE Transactions 2015

Sheng Li Yuya Akita Tatsuya Kawahara

The paper addresses a scheme of lightly supervised training of an acoustic model, which exploits a large amount of data with closed caption texts but not faithful transcripts. In the proposed scheme, a sequence of the closed caption text and that of the ASR hypothesis by the baseline system are aligned. Then, a set of dedicated classifiers is designed and trained to select the correct one among...

متن کامل

Generating Hypermedia Documents from Transcriptions of Television Programs Using Parallel Text Alignment

1998

David C. Gibbon

This paper presents a method of automatically creating hypermedia documents from conventional transcriptions of television programs. Using parallel text alignment techniques, the temporal information derived from the closed caption signal is exploited to convert the transcription into a synchronized text stream. Given this text stream, we can create links between the transcription and the image...

متن کامل

Cognitive experiments on timing lag for superimposing closed captions

1999

Ichiro Maruyama Yoshiharu Abe Eiji Sawamura Tetsuo Mitsuhashi Terumasa Ehara Katsuhiko Shirai

This paper describes cognitive characteristics of timing difference for closed captions superimposed onto TV news programs. It was reported that timing delays for superimposing disrupts hearing impaired people's enjoyment and intelligibility of TV, but nobody has yet investigated the permissible limit for timing difference. This study presents subjects' permissible and preferable limits of the ...

متن کامل

Retrieving of Video Scenes Using Arabic Closed-caption

2010

H. M. Nassar A. Taha T. M. Nazmy K. A. Nagaty

The increased use of video documents for multimedia-based applications has created a demand for strong video database support, including efficient methods for browsing and retrieving video data. Most solutions to video browsing and retrieval of video data rely on visual information only, ignoring the rich source of the accompanying audio signal and texts. Speech is the significant information t...

متن کامل