Robustness of Object Recognition under Extreme Occlusion in Humans and Computational Models

05/11/2019
by   Hongru Zhu, et al.
0

Most objects in the visual world are partially occluded, but humans can recognize them without difficulty. However, it remains unknown whether object recognition models like convolutional neural networks (CNNs) can handle real-world occlusion. It is also a question whether efforts to make these models robust to constant mask occlusion are effective for real-world occlusion. We test both humans and the above-mentioned computational models in a challenging task of object recognition under extreme occlusion, where target objects are heavily occluded by irrelevant real objects in real backgrounds. Our results show that human vision is very robust to extreme occlusion while CNNs are not, even with modifications to handle constant mask occlusion. This implies that the ability to handle constant mask occlusion does not entail robustness to real-world occlusion. As a comparison, we propose another computational model that utilizes object parts/subparts in a compositional manner to build robustness to occlusion. This performs significantly better than CNN-based models on our task with error patterns similar to humans. These findings suggest that testing under extreme occlusion can better reveal the robustness of visual recognition, and that the principle of composition can encourage such robustness.

READ FULL TEXT

page 2

page 3

page 4

page 5

page 6

research
06/28/2020

Compositional Convolutional Neural Networks: A Robust and Interpretable Model for Object Recognition under Occlusion

Computer vision systems in real-world applications need to be robust to ...
research
02/25/2021

Blocks World Revisited: The Effect of Self-Occlusion on Classification by Convolutional Neural Networks

Despite the recent successes in computer vision, there remain new avenue...
research
11/23/2016

'Part'ly first among equals: Semantic part-based benchmarking for state-of-the-art object recognition systems

An examination of object recognition challenge leaderboards (ILSVRC, PAS...
research
12/04/2014

Parsing Occluded People by Flexible Compositions

This paper presents an approach to parsing humans when there is signific...
research
04/09/2019

Embodied Visual Recognition

Passive visual systems typically fail to recognize objects in the amodal...
research
01/17/2023

Explain What You See: Open-Ended Segmentation and Recognition of Occluded 3D Objects

Local-HDP (for Local Hierarchical Dirichlet Process) is a hierarchical B...
research
05/22/2017

Learning Robust Object Recognition Using Composed Scenes from Generative Models

Recurrent feedback connections in the mammalian visual system have been ...

Please sign up or login with your details

Forgot password? Click here to reset