Occlusion robust pose and model estimation using Gaussian Process Latent Variable Model on GPU
نویسنده
چکیده
In this project, we would like to develop multi-object pose and model estimation which uses the 3D voxelized model and foreground background segmentation to detect 3D pose and model variation. In the relevant literature, [4], the authors used gradient descent to minimize the optimization energy function which is the integration over image multiplied by the smoothed step function of SDF of 3D model (silhouette) to jointly optimize the 6 degree location and rotations and deformation of model. The system is generally stable given prior mask and pose and works faster than state-of-the-art 3D pose estimation algorithm such as Zia and Stark [7]. However, the system requires prior mask which is hard to get automatically. In this project, we propose automatic segmentation mechanism that make use of object relation and alternating multi-object newton step. Thus, we improve the system by making it robust to texture and background color and especially occlusion from other object. The pipeline is first, we use [2] object detector to find location of objects and use these bounding boxes for contour detection techniques such as hierarchical image segmentation [8] or Grab Cut to capture the contour of objects which is essential for correct pose estimation. Then we run newton step for pose and model estimation for multiple objects.
منابع مشابه
Gaussian Process Latent Variable Models for Human Pose Estimation
We describe a generative approach to recover 3D human pose from image silhouettes. Our method is based on learning a shared low dimensional latent representation capable of generating both human pose and image observations through the GP-LVM [Law05] We learn a dynamical model over the latent space which allows us to disambiguate between ambiguous silhouettes by temporal consistency. The model h...
متن کاملModeling Human Locomotion with Topologically Constrained Latent Variable Models
Learned, activity-specific motion models are useful for human pose and motion estimation. Nevertheless, while the use of activityspecific models simplifies monocular tracking, it leaves open the larger issues of how one learns models for multiple activities or stylistic variations, and how such models can be combined with natural transitions between activities. This paper extends the Gaussian p...
متن کاملSimultaneous Monocular 2D Segmentation, 3D Pose Recovery and 3D Reconstruction
We propose a novel framework for joint 2D segmentation and 3D pose and 3D shape recovery, for images coming from a single monocular source. In the past, integration of all three has proven difficult, largely because of the high degree of ambiguity in the 2D 3D mapping. Our solution is to learn nonlinear and probabilistic low dimensional latent spaces, using the Gaussian Process Latent Variable ...
متن کاملBacking Off: Hierarchical Decomposition of Activity for 3D Novel Pose Recovery
For model-based 3D human pose estimation, even simple models of the human body lead to high-dimensional state spaces. Where the class of activity is known a priori, lowdimensional activity models learned from training data make possible a thorough and efficient search for the best pose. Conversely, searching for solutions in the full state space places no restriction on the class of motion to b...
متن کاملLatent Gaussian Mixture Regression for Human Pose Estimation
Discriminative approaches for human pose estimation model the functional mapping, or conditional distribution, between image features and 3D pose. Learning such multi-modal models in high dimensional spaces, however, is challenging with limited training data; often resulting in over-fitting and poor generalization. To address these issues latent variable models (LVMs) have been introduced. Shar...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013