Viewnet Architectures for Invariant 3-d Object Learning and Recognition from Multiple 2-d Views

نویسندگان

  • Stephen Grossberg
  • Gary Bradski
چکیده

3 ABSTRACT 3 The recognition of 3-D objects from sequences of their 2-D views is modeled by a family of self-organizing neural architectures, called VIEWNET, that use View Information Encoded With NETworks. VIEWNET incorporates a preprocessor that generates a compressed but 2-D invariant representation of an image, a supervised incremental learning system (Fuzzy ARTMAP) that classiies the preprocessed representations into 2-D view categories whose outputs are combined into 3-D invariant object categories, and a working memory that makes a 3-D object prediction by accumulating evidence over time from 3-D object category nodes as multiple 2-D views are experienced. VIEWNET was benchmarked on an MIT Lincoln Laboratory database of 128x128 2-D views of aircraft, including small frontal views, with and without additive noise. A recognition rate of up to 90% is achieved with one 2-D view and of up to 98.5% correct with three 2-D views. The properties of 2-D view and 3-D object category nodes are compared with those of cells in monkey inferotemporal cortex.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fast Learning VIEWNET Architectures for Recognizing 3-D Objects from Multiple 2-D Views

The recognition of 3-D objects from sequences of their 2-D views is modeled by a family of self-organizing neural architectures, called VIEWNET, that use View Information Encoded With NETworks. VIEWNET incorporates a preprocessor that generates a compressed but 2-D invariant representation of an image, a supervised incremental learning system that classifies the preprocessed representations int...

متن کامل

Fast-learning VIEWNET architectures for recognizing three-dimensional objects from multiple two-dimensional views

-The recognition o f three-dimensional ( 3-D ) objects from sequences o f their two-dimensional ( 2-D ) views is modeled by a family o f self-organizing neural architectures, called VIEWNET, that use View Information Encoded With NETworks. V IEWNET incorporates a preprocessor that generates a compressed but 2-D invariant representation o f an image, a supervised incremental learning system that...

متن کامل

Spatiotemporal information during unsupervised learning enhances viewpoint invariant object recognition.

Recognizing objects is difficult because it requires both linking views of an object that can be different and distinguishing objects with similar appearance. Interestingly, people can learn to recognize objects across views in an unsupervised way, without feedback, just from the natural viewing statistics. However, there is intense debate regarding what information during unsupervised learning...

متن کامل

Working Memory Networks for Learning Temporal Order, with Application to 3-D Visual Object Recognition

Working memory neural networks are characterized which encode the invariant temporal order of sequential events. Inputs to the networks, called Sustained Temporal Order REcurrent (STORE) models, may be presented at widely differing speeds, durations, and interstimulus intervals. The STORE temporal order code is designed to enable all emergent groupings of sequential events to be stably learned ...

متن کامل

How neurons learn to associate 2 D - views in invariant object recognition

A local learning rule is shown to be able to account for the association of images together on the basis of temporal order rather than spatial con guration, as described in single cell recording results published by Miyashita (1988). Possible reasons for requiring such learning are then given in the context of invariant object recognition This work was supported by a Fellowship from the Max-Pla...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007