captioning order

نتایج جستجو برای: captioning order

تعداد نتایج: 908879 فیلتر نتایج به سال:

Object Captioning and Retrieval with Natural Language

2018

Anh Nguyen Thanh-Toan Do Ian Reid Darwin G. Caldwell Nikos G. Tsagarakis

We address the problem of jointly learning vision and language to understand the object in a fine-grained manner. The key idea of our approach is the use of object descriptions to provide the detailed understanding of an object. Based on this idea, we propose two new architectures to solve two related problems: object captioning and natural language-based object retrieval. The goal of the objec...

متن کامل

Emotive Captioning and Access to Television

2005

Deborah I. Fels Daniel G. Lee Carmen Branje Matthew Hornburg

Closed captioning has been enabling access to television for people who are deaf and hard of hearing since the early 1970s. Since that time, technology and people’s demands have been steadily improving and increasing. Closed captioning has not kept up with these changes. We present the results of a study that used graphics, colour, icons and animation as well as text, emotive captions, to captu...

متن کامل

Adaptive Feature Abstraction for Translating Video to Text

2016

Yunchen Pu Martin Renqiang Min Zhe Gan Lawrence Carin

Previous models for video captioning often use the output from a specific layer of a Convolutional Neural Network (CNN) as video features. However, the variable contextdependent semantics in the video may make it more appropriate to adaptively select features from the multiple CNN layers. We propose a new approach for generating adaptive spatiotemporal representations of videos for the captioni...

متن کامل

Effectiveness of captioned videos for incidental vocabulary learning and retention: the role of working memory

Journal: :Computer Assisted Language Learning 2023

Working memory (WM) may be an essential component of incidental vocabulary learning and retention from captioned videos. However, how WM affects young learners’ under different types captions remains unclear. The present study employs a between-subject research design. main purpose is to examine two WM— phonological short-term complex WM—impact outcomes incidentally learned retained three capti...

متن کامل

Arabic Image Captioning: The Effect of Text Pre-processing on the Attention Weights and the BLEU-N Scores

Journal: :International Journal of Advanced Computer Science and Applications 2022

Image captioning using deep neural networks has recently gained increasing attention, mostly for English langue, with only few studies in other languages. Good image model is required to automatically generate sensible, syntactically and semantically correct captions, which turn requires good models both computer vision natural language processing. The process more challenging case of data scar...

متن کامل

Image Caption Generator in Hindi Using Attention

Journal: :Advances in transdisciplinary engineering 2022

Image Captioning has gained tremendous spotlight in recent years. However, the captioning models generate captions English language. In this paper, we present an image caption generator for our regional language that is Hindi using Resnet50 and LSTM with attention module. An experimental study shown highlighting effect of attention-based learning on generated captions. Flickr8k dataset used to ...

متن کامل

Grained Classification and Captioning Tasks

2018

Raghav Goyal Farzaneh Mahdisoltani Guillaume Berger Waseem Gharbieh Ingo Bax Roland Memisevic

Understanding concepts in the world remains one of the well-sought endeavours of ML. Whereas ImageNet enabled success in object recognition and various related tasks via transfer learning, the ability to understand physical concepts prevalent in the world still remains an unattained, yet desirable, goal. Video as a vision modality encodes how objects change across time with respect to pose, pos...

متن کامل

Learning to Evaluate Image Captioning

2018

Yin Cui Guandao Yang Andreas Veit Xun Huang Serge Belongie

Evaluation metrics for image captioning face two challenges. Firstly, commonly used metrics such as CIDEr, METEOR, ROUGE and BLEU often do not correlate well with human judgments. Secondly, each metric has well known blind spots to pathological caption constructions, and rulebased metrics lack provisions to repair such blind spots once identified. For example, the newly proposed SPICE correlate...

متن کامل

Video Captioning with Multi-Faceted Attention

Journal: :Transactions of the Association for Computational Linguistics 2018

متن کامل

Multimodal Feature Learning for Video Captioning

Journal: :Mathematical Problems in Engineering 2018

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید