Attention and Scene Understanding
نویسندگان
چکیده
Abstract: This paper presents a simplified, introductory view of how visual atThis paper presents a simplified, introductory view of how visual attention may contribute to and integrate within the broader framework of visual scene understanding. Several key components are identified which cooperate with attention during the analysis of complex dynamic visual inputs, namely rapid computation of scene gist and layout, localized object recognition and tracking at attended locations, working memory that holds a representation of currently relevant targets, and longterm memory of known world entities and their inter-relationships. Evidence from neurobiology and psychophysics is provided to support the proposed architecture.
منابع مشابه
Saliency and Task-Based Eye Movement Prediction and Guidance
The ability to predict and guide viewer attention has important applications in computer graphics, image and scene understanding, object detection, visual search and training. Human eye movements have interested researchers as they provide insight into the cognitive processes involved in task performance. It has also interested researchers to understand what guides viewer attention in a scene. ...
متن کاملVisual Context Driven Semantic Priming of Speech Recognition and Understanding
Fuse is a spoken language understanding system that integrates visual context into early stages of speech recognition. Given a visual scene and a spoken description, the system finds the object in the scene that best fits the meaning of the description. To solve this task, Fuse performs speech recognition and visually-grounded language understanding. Rather than treat these two problems separat...
متن کاملThe Role of Scene Gist and Spatial Dependency among Objects in the Semantic Guidance of Attention
A previous study (Hwang et al., 2011) found evidence for semantic guidance of visual attention during the inspection of real-world scenes, i.e., an influence of semantic relationships among scene objects on overt shifts of attention. In particular, the results revealed an observer bias toward gaze transitions between semantically similar objects. However, these results are not necessarily indic...
متن کاملGuidance of visual attention by semantic information in real-world scenes
Recent research on attentional guidance in real-world scenes has focused on object recognition within the context of a scene. This approach has been valuable for determining some factors that drive the allocation of visual attention and determine visual selection. This article provides a review of experimental work on how different components of context, especially semantic information, affect ...
متن کاملInferring Shared Attention in Social Scene Videos
This paper addresses a new problem of inferring shared attention in third-person social scene videos. Shared attention is a phenomenon that two or more individuals simultaneously look at a common target in social scenes. Perceiving and identifying shared attention in videos plays crucial roles in social activities and social scene understanding. We propose a spatial-temporal neural network to d...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005