captioning order

Novel Approach to Live Captioning Through Re-speaking: Tailoring Speech Recognition to Re-speaker's Needs

2012

Ales Prazák Zdenek Loose Jan Trmal Josef V. Psutka Josef Psutka

A novel approach to the live captioning through re-speaking is introduced in this paper. We describe our concept of respeaking using only one re-speaker with enhanced re-speaker tasks fully integrated to the recognition system and captioning software. New techniques for instant correction of recognition output, punctuation mark introduction or new word addition are presented. Our real-time reco...

متن کامل

Multi-Task Video Captioning with Video and Entailment Generation

2017

Ramakanth Pasunuru Mohit Bansal

Video captioning, the task of describing the content of a video, has seen some promising improvements in recent years with sequence-to-sequence models, but accurately learning the temporal and logical dynamics involved in the task still remains a challenge, especially given the lack of sufficient annotated data. We improve video captioning by sharing knowledge with two related directed-generati...

متن کامل

Evaluating automatically generated user-focused multi-document summaries for geo-referenced images

2008

Ahmet Aker Robert J. Gaizauskas

This paper reports an initial study that aims to assess the viability of a state-of-the-art multi-document summarizer for automatic captioning of geo-referenced images. The automatic captioning procedure requires summarizing multiple web documents that contain information related to images’ location. We use SUMMA (Saggion and Gaizauskas, 2005) to generate generic and query-based multi-document ...

متن کامل

Automated Image Captioning Using Nearest-Neighbors Approach Driven by Top-Object Detections

2017

Karan Sharma Arun CS Kumar Suchendra M. Bhandarkar

The significant performance gains in deep learning coupled with the exponential growth of image and video data on the Internet have resulted in the recent emergence of automated image captioning systems. Two broad paradigms have emerged in automated image captioning, i.e., generative model-based approaches and retrieval-based approaches. Although generative model-based approaches that use the r...

متن کامل

From Deterministic to Generative: Multi-Modal Stochastic RNNs for Video Captioning

Journal: :CoRR 2017

Jingkuan Song Yuyu Guo Lianli Gao Xuelong Li Alan Hanjalic Heng Tao Shen

Video captioning in essential is a complex natural process, which is affected by various uncertainties stemming from video content, subjective judgment, etc. In this paper we build on the recent progress in using encoder-decoder framework for video captioning and address what we find to be a critical deficiency of the existing methods, that most of the decoders propagate deterministic hidden st...

متن کامل

Summary Generation for Toponym-referenced Images using Object Type Language Models

2009

Ahmet Aker Robert J. Gaizauskas

This paper presents a novel approach to automatic captioning of toponym-referenced images. The automatic captioning procedure works by summarizing multiple web-documents that contain information related to an image’s location. Our summarizer can generate both query-based and language model-biased multidocument summaries. The models are created from large numbers of existing articles pertaining ...

متن کامل

Aesthetically Relevant Image Captioning

Journal: :Proceedings of the ... AAAI Conference on Artificial Intelligence 2023

Image aesthetic quality assessment (AQA) aims to assign numerical ratings images whilst image captioning (IAC) generate textual descriptions of the aspects images. In this paper, we study AQA and IAC together present a new method termed Aesthetically Relevant Captioning (ARIC). Based on observation that most comments an are about objects their interactions rather than aesthetics, first introduc...

متن کامل

Automated closed-captioning of live TV broadcast news in French

2003

Julie Brousseau Jean-Francois Beaumont Gilles Boulianne Patrick Cardinal Claude Chapdelaine Michel Comeau Frédéric Osterrath Pierre Ouellet

This paper describes the system currently under development at CRIM whose aim is to provide real-time closed captioning of live TV broadcast news in Canadian French. This project is done in collaboration with TVA Network, a national TV broadcaster and the RQST (a Québec association which promotes the use of subtitling). The automated closed-captioning system will use CRIM’s transducer-based lar...

متن کامل

Generating captions without looking beyond objects

Journal: :CoRR 2016

Hendrik Heuer Christof Monz Arnold W. M. Smeulders

This paper explores new evaluation perspectives for image captioning and introduces a noun translation task that achieves comparative image caption generation performance by translating from a set of nouns to captions. This implies that in image captioning, all word categories other than nouns can be evoked by a powerful language model without sacrificing performance on n-gram precision. The pa...

متن کامل

Important New Enhancements to Inclusive Learning Using Recorded Lectures

2012

Mike Wald

This paper explains three new important enhancements to Synote, the freely available, award winning, open source, web based application that makes web hosted recordings easier to access, search, manage, and exploit for learners, teachers and other users. The facility to convert and import narrated PowerPoint PPTX files means that teachers can capture and caption their lectures without requiring...

متن کامل