Human Daily Action Analysis with Multi-view and Color-Depth Data

نویسندگان

  • Zhongwei Cheng
  • Lei Qin
  • Yituo Ye
  • Qingming Huang
  • Qi Tian
چکیده

Improving human action recognition in videos is restricted by the inherent limitations of the visual data. In this paper, we take the depth information into consideration and construct a novel dataset of human daily actions. The proposed ACT4 dataset provides synchronized data from 4 views and 2 sources, aiming to facilitate the research of action analysis across multiple views and multiple sources. We also propose a new descriptor of depth information for action representation, which depicts the structural relations of spatiotemporal points within action volume using the distance information in depth data. In experimental validation, our descriptor obtains superior performance to the state-of-the-art action descriptors designed for color information, and more robust to viewpoint variations. The fusion of features from different sources is also discussed, and a simple but efficient method is presented to provide a baseline performance on the proposed dataset.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multi-depth Camera System for 3d Video Generation

In this paper, we describe a multi-depth camera system for 3D video contents generation. We combine five video cameras and five TOF depth cameras to capture a scene in real time. By taking advantages of both active and passive sensor based depth acquisition methods, we can estimate multi-view depth sequences accurately. After performing several steps of preprocessing, the depth sequences are wa...

متن کامل

Weighted Fusion of Depth and Inertial Data to Improve View Invariance for Human Action Recognition

This paper presents an extension to our previously developed fusion framework [10] involving a depth camera and an inertial sensor in order to improve its view invariance aspect for human action recognition applications. A computationally efficient view estimation based on skeleton joints is considered in order to select the most relevant depth training data when recognizing test samples. Two c...

متن کامل

Depth Based View Synthesis Using Graph Cuts for 3DTV

In three-dimensional television (3DTV), an interactive free viewpoint selection application has received more attention so far. This paper presents a novel method that synthesizes a free-viewpoint based on multiple textures and depth maps in multi-view camera configuration. This method solves the cracks and holes problem due to sampling rate by performing an inverse warping to retrieve texture ...

متن کامل

Depth Improvement for FTV Systems Based on the Gradual Omission of Outliers

Virtual view synthesis is an essential part of computer vision and 3D applications. A high-quality depth map is the main problem with virtual view synthesis. Because as compared to the color image the resolution of the corresponding depth image is low. In this paper, an efficient and confided method based on the gradual omission of outliers is proposed to compute reliable depth values. In the p...

متن کامل

A study of depth/texture bit-rate allocation in multi-view video plus depth compression

Multi-view Video plus Depth (MVD) data offer a reliable representation of three dimensional (3D) scenes for 3D Video applications. This is a huge amount of data whose compression is an important challenge for researchers at the current time. Consisting of texture and depth video sequences, the question of the relationship between these two types of data regarding bitrate allocation often raises...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012