Learning Robust Object Recognition Using Composed Scenes from Generative Models

05/22/2017
by   Hao Wang, et al.
0

Recurrent feedback connections in the mammalian visual system have been hypothesized to play a role in synthesizing input in the theoretical framework of analysis by synthesis. The comparison of internally synthesized representation with that of the input provides a validation mechanism during perceptual inference and learning. Inspired by these ideas, we proposed that the synthesis machinery can compose new, unobserved images by imagination to train the network itself so as to increase the robustness of the system in novel scenarios. As a proof of concept, we investigated whether images composed by imagination could help an object recognition system to deal with occlusion, which is challenging for the current state-of-the-art deep convolutional neural networks. We fine-tuned a network on images containing objects in various occlusion scenarios, that are imagined or self-generated through a deep generator network. Trained on imagined occluded scenarios under the object persistence constraint, our network discovered more subtle and localized image features that were neglected by the original network for object classification, obtaining better separability of different object classes in the feature space. This leads to significant improvement of object recognition under occlusion for our network relative to the original network trained only on un-occluded images. In addition to providing practical benefits in object recognition under occlusion, this work demonstrates the use of self-generated composition of visual scenes through the synthesis loop, combined with the object persistence constraint, can provide opportunities for neural networks to discover new relevant patterns in the data, and become more flexible in dealing with novel situations.

READ FULL TEXT

page 3

page 4

page 7

research
04/21/2021

Recurrent Feedback Improves Recognition of Partially Occluded Objects

Recurrent connectivity in the visual cortex is believed to aid object re...
research
09/12/2019

Recurrent Connectivity Aids Recognition of Partly Occluded Objects

Feedforward convolutional neural networks are the prevalent model of cor...
research
05/11/2019

Robustness of Object Recognition under Extreme Occlusion in Humans and Computational Models

Most objects in the visual world are partially occluded, but humans can ...
research
11/02/2020

Deep Feature Augmentation for Occluded Image Classification

Due to the difficulty in acquiring massive task-specific occluded images...
research
09/09/2019

TDAPNet: Prototype Network with Recurrent Top-Down Attention for Robust Object Classification under Partial Occlusion

Despite deep convolutional neural networks' great success in object clas...
research
05/05/2023

Persistent Homology Meets Object Unity: Object Recognition in Clutter

Recognition of occluded objects in unseen and unstructured indoor enviro...
research
09/27/2022

Reconstruction-guided attention improves the robustness and shape processing of neural networks

Many visual phenomena suggest that humans use top-down generative or rec...

Please sign up or login with your details

Forgot password? Click here to reset