Unsupervised Object Discovery and Co-Localization by Deep Descriptor Transforming

نویسندگان

  • Xiu-Shen Wei
  • Chen-Lin Zhang
  • Jianxin Wu
  • Chunhua Shen
  • Zhi-Hua Zhou
چکیده

Reusable model design becomes desirable with the rapid expansion of computer vision and machine learning applications. In this paper, we focus on the reusability of pre-trained deep convolutional models. Specifically, different from treating pre-trained models as feature extractors, we reveal more treasures beneath convolutional layers, i.e., the convolutional activations could act as a detector for the common object in the image colocalization problem. We propose a simple yet effective method, termed Deep Descriptor Transforming (DDT), for evaluating the correlations of descriptors and then obtaining the category-consistent regions, which can accurately locate the common object in a set of unlabeled images, i.e., unsupervised object discovery. Empirical studies validate the effectiveness of the proposed DDT method. On benchmark image co-localization datasets, DDT consistently outperforms existing state-of-the-art methods by a large margin. Moreover, DDT also demonstrates good generalization ability for unseen categories and robustness for dealing with noisy data. Beyond those, DDT can be also employed for harvesting web images into valid external data sources for improving performance of both image recognition and object detection. The first two authors contributed equally to this work. This work was done when X.-S. Wei was visiting the University of Adelaide. Corresponding authors: Jianxin Wu and Chunhua Shen Xiu-Shen Wei · Chen-Lin Zhang · Jianxin Wu · Zhi-Hua Zhou Nanjing University, China E-mail: {weixs, zhangcl, wujx, zhouzh}@lamda.nju.edu.cn Chunhua Shen The University of Adelaide, Australia E-mail: [email protected]

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Deep Descriptor Transforming for Image Co-Localization

Reusable model design becomes desirable with the rapid expansion of machine learning applications. In this paper, we focus on the reusability of pre-trained deep convolutional models. Specifically, different from treating pre-trained models as feature extractors, we reveal more treasures beneath convolutional layers, i.e., the convolutional activations could act as a detector for the common obj...

متن کامل

Unsupervised deep object discovery for instance recognition

Severe background clutter is challenging in many computer vision tasks, including large-scale image retrieval. Global descriptors, that are popular due to their memory and search efficiency, are especially prone to corruption by such a clutter. Eliminating the impact of the clutter on the image descriptor increases the chance of retrieving relevant images and prevents topic drift due to actuall...

متن کامل

Object Localization with Boosting and Weak Supervision for Generic Object Recognition

This paper deals, for the first time, with an analysis of localization capabilities of weakly supervised categorization systems. Most existing categorization approaches have been tested on databases, which (a) either show the object(s) of interest in a very prominent way so that their localization can hardly be judged from these experiments, or (b) at least the learning procedure was done with ...

متن کامل

On the Applicability of Unsupervised Feature Learning for Object Recognition in RGB-D Data

We present a feature extraction method for RGB-D data based on k-means clustering that builds on recent work by Coates et al. Using unsupervised learning methods we are able to automatically learn feature responses that combine all available information (color and depth) into one, concise representation. We show that depth information can substantially increase the recognition performance and t...

متن کامل

Laminar Organization of Cerebral Cortex in Transforming Growth Factor Beta Mutant Mice

Transforming growth factor betas (TGF?s) are one of the most widespread and versatile cytokines. The three mammalian TGF? isoforms, ?1, ?2, and ?3, and their receptors regulate proliferation of neuronal precursors as well as survival and differentiation in neurons of developing and adult nervous system. Functions of TGF?s has a wide spectrum ranging from regulating cell proliferation and differ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1707.06397  شماره 

صفحات  -

تاریخ انتشار 2017