Video Question-Answering Techniques, Benchmark Datasets and Evaluation Metrics Leveraging Video Captioning: A Comprehensive Survey
نویسندگان
چکیده
While describing visual data is a trivial task for humans, it an intricate computer. This even more challenging if the video. Comprehending video and called Video Captioning. involves understanding semantics of then generating human-like descriptions It requires collaboration both research communities computer vision natural language processing. The captions generated by captioning can be further utilized retrieval, summarization, question-answering, etc. Question-Answering (video-QA) querying system to obtain answer in response. paper presents brief survey techniques comprehensive review existing techniques, datasets, evaluation metrics video-QA. Video-QA rely on attention mechanism generate relevant results. presented shows that recent works Memory Networks, Generative Adversarial Reinforced Decoders, have capability handle complexities challenges Additionally, graph-based methods, although less explored, give very promising In this article, we discussed emerging directions various application areas
منابع مشابه
Leveraging Video Descriptions to Learn Video Question Answering
We propose a scalable approach to learn video-based question answering (QA): to answer a free-form natural language question about the contents of a video. Our approach automatically harvests a large number of videos and descriptions freely available online. Then, a large number of candidate QA pairs are automatically generated from descriptions rather than manually annotated. Next, we use thes...
متن کاملSurvey of Visual Question Answering: Datasets and Techniques
Visual question answering (or VQA) is a new and exciting problem that combines natural language processing and computer vision techniques. We present a survey of the various datasets and models that have been used to tackle this task. The first part of this survey details the various datasets for VQA and compares them along some common factors. The second part of this survey details the differe...
متن کاملVideo Compression Techniques – A Comprehensive Survey
In modern world more image and video compression technologies improving gradually. But major barrier of the new technologies concepts are repeated from authors. Because authors can’t find in depth of the papers from various technologies. So Survey of Video Compression Techniques very helpful for these types problems in video compression areas. Video compression techniques such as DCT coding, Qu...
متن کاملEffective Question Answering Techniques and their Evaluation Metrics
Question Answering (QA) is a focused way of information retrieval. Question Answering system tries to get back the accurate answers to questions posed in natural language provided a set of documents. Basically question answering system (QA) has three elements i. e. question classification, information retrieval (IR), and answer extraction. These elements play a major role in Question Answering....
متن کاملQuestion Answering Evaluation Survey
Evaluating Question Answering (QA) Systems is a very complex task: state-of-the-art systems involve processing whose influences and contributions on the final result are not clear and need to be studied. We present some key points on different aspects of the QA Systems (QAS) evaluation: mainly, as performed during large-scale campaigns, but also with clues on the evaluation of QAS typical softw...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Access
سال: 2021
ISSN: ['2169-3536']
DOI: https://doi.org/10.1109/access.2021.3058248