Hypothesize and Bound: A Computational Focus of Attention Mechanism for Simultaneous 3D Shape Reconstruction, Pose Estimation and Classification from a Single 2D Image

نویسندگان

  • Diego Rother
  • Siddharth Mahendran
  • René Vidal
چکیده

This article presents a mathematical framework to simultaneously tackle the problems of 3D reconstruction, pose estimation and object classification, from a single 2D image. In sharp contrast with state of the art methods that rely primarily on 2D information and solve each of these three problems separately or iteratively, we propose a mathematical framework that incorporates prior “knowledge” about the 3D shapes of different object classes and solves these problems jointly and simultaneously, using a hypothesizeand-bound (H&B) algorithm [14]. In the proposed H&B algorithm one hypothesis is defined for each possible pair [object class, object pose], and the algorithm selects the hypothesis H that maximizes a function L(H) encoding how well each hypothesis “explains” the input image. To find this maximum efficiently, the function L(H) is not evaluated exactly for each hypothesis H, but rather upper and lower bounds for it are computed at a much lower cost. In order to obtain bounds for L(H) that are tight yet inexpensive to compute, we extend the theory of shapes described in [14] to handle projections of shapes. This extension allows us to define a probabilistic relationship between the prior knowledge given in 3D and the 2D input image. This relationship is derived from first principles and is proven to be the only relationship having the properties that we intuitively expect from a “projection.” In addition to the efficiency and optimality characteristics of H&B algorithms, the proposed framework has the desirable property of integrating information D. Rother · S. Mahendran · R. Vidal Johns Hopkins University Tel.: +1-410-516-6736 E-mail: [email protected] in the 2D image with information in the 3D prior to estimate the optimal reconstruction. While this article focuses primarily on the problem mentioned above, we believe that the theory presented herein has multiple other potential applications.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Simultaneous Monocular 2D Segmentation, 3D Pose Recovery and 3D Reconstruction

We propose a novel framework for joint 2D segmentation and 3D pose and 3D shape recovery, for images coming from a single monocular source. In the past, integration of all three has proven difficult, largely because of the high degree of ambiguity in the 2D 3D mapping. Our solution is to learn nonlinear and probabilistic low dimensional latent spaces, using the Gaussian Process Latent Variable ...

متن کامل

A New Approach for Quantitative Evaluation of Reconstruction Algorithms in SPECT

ABTRACT Background: In nuclear medicine, phantoms are mainly used to evaluate the overall performance of the imaging systems and practically there is no phantom exclusively designed for the evaluation of the software performance.  In this study the Hoffman brain phantom was used for quantitative evaluation of reconstruction techniques. The phantom is modified to acquire t...

متن کامل

Automatic, Effective, and Efficient 3D Face Reconstruction from Arbitrary View Image

In this paper, we propose a fully automatic, effective and efficient framework for 3D face reconstruction based on a single face image in arbitrary view. First, a multi-view face alignment algorithm localizes the face feature points, and then EM algorithm is applied to derive the optimal 3D shape and position parameters. Moreover, the unit quaternion based pose representation is proposed for ef...

متن کامل

Seeing Glassware: from Edge Detection to Pose Estimation and Shape Recovery

Perception of transparent objects has been an open challenge in robotics despite advances in sensors and datadriven learning approaches. In this paper, we introduce a new approach that combines recent advances in learnt object detectors with perceptual grouping in 2D, and projective geometry of apparent contours in 3D. We train a state of the art structured edge detector on an annotated set of ...

متن کامل

SEEING 3D OBJECTS IN A SINGLE 2D IMAGE By

A general framework simultaneously addressing pose estimation, 2D segmentation, object recognition, and 3D reconstruction from a single image is introduced in this paper. The proposed approach partitions 3D space into voxels and estimates the voxel states that maximize a likelihood integrating two components: the object fidelity, that is, the probability that an object occupies the given voxels...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1109.5730  شماره 

صفحات  -

تاریخ انتشار 2011