Robust Object Recognition Under Partial Occlusions Using an RGB-D Camera
نویسندگان
چکیده
For a robot to execute a specific task, the robot firstly has to recognize what objects are in robot’s view. To complete a specific task in a given time, the computation time for recognition is also important. There are much research for increasing recognition accuracy, but the recognition speed is not enough to be applied in real environment. On the other hand, there are also much research for reducing the computation time for recognition, but the recognition accuracy needs to be further improved. Nowadays, deep network has come into the spotlight due to its speed and accuracy. Deep network doesn’t need to find hand-tuned features. This paper proseses a deep network-based object recognition algorithm. The main contribution is that objects could be recognized under occlusion, as objects are often laid to overlap each other. The occlusion makes object recognition accuracy worse. To overcome this problem, the dataset for training consists of not full images but partial information of images and corresponding ground truths. The object region could be found very quickly by using an RGB-D camera. By assuming that most objects are on the stable plane, object regions are taken easily. Experimental results demonstrate such consideration of contextual information (e.g. objects are on the table) makes the performance of recognition better.
منابع مشابه
Hyper Frame Vision: A Real-Time Vision System for 6-DOF Object Localization
A new system for robot vision is proposed that integrates a 3-D object recognition task and a 3-D object tracking task, enabling real-time 6-DOF localization of a known continuously moving object. A computational time-lag between the two tasks is absorbed by a large amount of frame memory. For 3-D sensing, calibrated trinocular stereo cameras are used to employ stereo-vision-based object recogn...
متن کاملA probabilistic integrated object recognition and tracking framework
This paper describes a probabilistic integrated object recognition and tracking framework called PIORT, together with two specific methods derived from it, which are evaluated experimentally in several test video sequences. The first step in the proposed framework is a static recognition module that provides class probabilities for each pixel of the image from a set of local features. These pro...
متن کاملMulti-modal user identification and object recognition surveillance system
We propose an automatic surveillance system for user identification and object recognition based on multi-modal RGB-Depth data analysis. We model a RGBD environment learning a pixel-based background Gaussian distribution. Then, user and object candidate regions are detected and recognized using robust statistical approaches. The system robustly recognizes users and updates the system in an onli...
متن کاملForeground Detection on Depth Maps Using Skeletal Representation of Object Silhouettes
This article considers the problem of foreground detection on depth maps. The problem of finding objects of interest on images appears in many object detection, recognition and tracking applications as one of the first steps. However, this problem becomes too complicated for RGB images with multicolored or constantly changing background and in presence of occlusions. Depth maps provide valuable...
متن کاملAn RGB-D based image set classification for robust face recognition from Kinect data
The paper proposes a method for robust face recognition from low quality Kinect acquired images which have a wide range of variations in head pose, illumination, facial expressions, sunglass disguise and occlusions by hand. Multiple Kinect images of a person are considered as an image set and face recognition from these images is formulated as an RGB-D image set classification problem. The Kine...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014