MVDet: multi-view multi-class object detection without ground plane assumption

نویسندگان

چکیده

Abstract Although many state-of-the-art methods of object detection in a single image have achieved great success the last few years, they still suffer from false positives crowd scenes real-world applications like automatic checkout. In order to address limitations single-view complex scenes, we propose MVDet, an end-to-end learnable approach that can detect and re-identify multi-class objects multiple images captured by cameras (multi-view). Our is based on premise incorrect results specific view be eliminated using precise cues other views, given availability multi-view images. Unlike most existing algorithms, which assume belong class ground plane, our classify without such assumptions thus more practical. To objects, integrated architecture for region proposal, re-identification, classification. Additionally, utilize epipolar geometry constraint devise novel re-identification algorithm does not require about plane assumption. model demonstrates competitive performance compared several baselines challenging MessyTable dataset.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Models for multi-view object class detection

Learning how to detect objects from many classes in a wide variety of viewpoints is a key goal of computer vision. Existing approaches, however, require excessive amounts of training data. Implementors need to collect numerous training images not only to cover changes in the same object’s shape due to the viewpoint variation, but also to accommodate the variability in appearance among instances...

متن کامل

Multi-View Object Detection by Classifier Interpolation

In this paper, we propose a novel solution for multi-view object detection. Given a set of training examples at different views, we select examples at a few key views and train one classifier for each of them. Then classifiers for more intermediate views can be interpolated from key views. The interpolation is conducted on the weights and positions of features, under the assumption that they ca...

متن کامل

Multi-View Background Subtraction for Object Detection

Consider a popular tourist destination shown in Figure 1. How can we exploit the large set of photographs available online depicting this same general location in order to better understand the content of this particular image? It is useful to divide scene components into two categories: dynamic objects such as people, bikes, cars, pigeons or street vendors that move about and are likely to onl...

متن کامل

Multi-class Multi-object Tracking Using Changing Point Detection

This paper presents a robust multi-class multi-object tracking (MCMOT) formulated by a Bayesian filtering framework. Multiobject tracking for unlimited object classes is conducted by combining detection responses and changing point detection (CPD) algorithm. The CPD model is used to observe abrupt or abnormal changes due to a drift and an occlusion based spatiotemporal characteristics of track ...

متن کامل

Iterative Multi-View Plane Fitting

We present a method for the reconstruction of 3D planes from calibrated 2D images. Given a set of pixelsΩ in a reference image, our method computes a plane which best approximates that part of the scene which has been projected to Ω by exploiting additional views. Based on classical image alignment techniques we derive linear matching equations minimally parameterized by the three parameters of...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Pattern Analysis and Applications

سال: 2023

ISSN: ['1433-755X', '1433-7541']

DOI: https://doi.org/10.1007/s10044-023-01168-6