Actions As Objects: A Novel Action Representation

نویسندگان

  • Alper Yilmaz
  • Mubarak Shah
چکیده

In this paper, we propose to model an action based on both the shape and the motion of the object performing the action. When the object performs an action in 3D, the points on the outer boundary of the object are projected as 2D (x, y) contour in the image plane. A sequence of such 2D contours with respect to time generates a spatiotemporal volume (STV) in (x, y, t), which can be treated as 3D object in the (x, y, t) space. We analyze STV by using the differential geometric surface properties, such as peaks, pits, valleys and ridges, which are important action descriptors capturing both spatial and temporal properties. A set of motion descriptors for a given is called an action sketch. The action descriptors are related to various types of motions and object deformations. The first step in our approach is to generate STV by solving the point correspondence problem between consecutive frames. The correspondences are determined using a two-step graph theoretical approach. After the STV is generated, actions descriptors are computed by analyzing the differential geometric properties of STV. Finally, using these descriptors, we perform action recognition, which is also formulated as graph theoretical problem. Several experimental results are presented to demonstrate our approach.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Action Change Detection in Video Based on HOG

Background and Objectives: Action recognition, as the processes of labeling an unknown action of a query video, is a challenging problem, due to the event complexity, variations in imaging conditions, and intra- and inter-individual action-variability. A number of solutions proposed to solve action recognition problem. Many of these frameworks suppose that each video sequence includes only one ...

متن کامل

How Outcomes of Actions Influence Infants’ Representation of Those Actions

Three experiments examined how the outcomes of actions influences 10-month-old infants’ (N = 56) representation of those actions and the objects on which they are performed in dynamic, multimodal events. In each experiment, infants were habituated to events in which a colorful novel object was manipulated by a hand. Infants learned that some actions produced outcomes while others did not (Exper...

متن کامل

Generalizing Manipulations using Vision Kernels

In order to perform complex manipulation tasks, a robot must know which actions it can perform with the available objects. In unstructured environments, potential manipulations afforded by objects will not be pre-specified, and must instead be learned. Rather than determining each novel object’s affordances from scratch, the robot can learn more efficiently by generalizing manipulations from si...

متن کامل

Integration and Action in Perception/Action Systems with Access to Non-Local Space Information

Actions and objects are closely tied together (Russell & Norvig 1995). We often think of the actions we can perform on objects as properties of those objects. The theory of action presented here, contains the belief that actions are defined in terms of the objects they effect. Intuitively then, actions consist of a verb and a noun, for example, pick-up-the-soda-can, go-to-the-car, or paint-the-...

متن کامل

A Framework for Combined Recognition of Actions and Objects

This paper proposes a novel approach to recognize actions and objects within the context of each other. Assuming that the different actions involve different objects in image sequences and there is one-toone relation between object and action type, we present a Bayesian network based framework which combines motion patterns and object usage information to recognize actions/objects. More specifi...

متن کامل

XSAMPL3D: An Action Description Language for the Animation of Virtual Characters

In this paper we present XSAMPL3D, a novel language for the high-level representation of actions performed on objects by (virtual) humans. XSAMPL3D was designed to serve as action representation language in an imitation-based approach to character animation: First, a human demonstrates a sequence of object manipulations in an immersive Virtual Reality (VR) environment. From this demonstration, ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005