DeepVoting: An Explainable Framework for Semantic Part Detection under Partial Occlusion

09/14/2017
by   Zhishuai Zhang, et al.
0

In this paper, we study the task of detecting semantic parts of an object. This is very important in computer vision, as it provides the possibility to parse an object as human do, and helps us better understand object detection algorithms. Also, detecting semantic parts is very challenging especially when the parts are partially or fully occluded. In this scenario, the popular proposal-based methods like Faster-RCNN often produce unsatisfactory results, because both the proposal extraction and classification stages may be confused by the irrelevant occluders. To this end, we propose a novel detection framework, named DeepVoting, which accumulates local visual cues, called visual concepts (VC), to locate the semantic parts. Our approach involves adding two layers after the intermediate outputs of a deep neural network. The first layer is used to extract VC responses, and the second layer performs a voting mechanism to capture the spatial relationship between VC's and semantic parts. The benefit is that each semantic part is supported by multiple VC's. Even if some of the supporting VC's are missing due to occlusion, we can still infer the presence of the target semantic part using the remaining ones. To avoid generating an exponentially large training set to cover all occlusion cases, we train our model without seeing occlusion and transfer the learned knowledge to deal with occlusions. This setting favors learning the models which are naturally robust and adaptive to occlusions instead of over-fitting the occlusion patterns in the training data. In experiments, DeepVoting shows significantly better performance on semantic part detection in occlusion scenarios, compared with Faster-RCNN, with one order of magnitude fewer parameters and 2.5x testing speed. In addition, DeepVoting is explainable as the detection result can be diagnosed via looking up the voted VC's.

READ FULL TEXT

page 2

page 3

page 5

page 6

page 7

research
07/25/2017

Detecting Semantic Parts on Partially Occluded Objects

In this paper, we address the task of detecting semantic parts on partia...
research
06/28/2020

Compositional Convolutional Neural Networks: A Robust and Interpretable Model for Object Recognition under Occlusion

Computer vision systems in real-world applications need to be robust to ...
research
11/13/2017

Visual Concepts and Compositional Voting

It is very attractive to formulate vision in terms of pattern theory Mum...
research
09/09/2019

TDAPNet: Prototype Network with Recurrent Top-Down Attention for Robust Object Classification under Partial Occlusion

Despite deep convolutional neural networks' great success in object clas...
research
06/12/2022

Object Occlusion of Adding New Categories in Objection Detection

Building instance detection models that are data efficient and can handl...
research
04/18/2020

Occluded Prohibited Items Detection: An X-ray Security Inspection Benchmark and De-occlusion Attention Module

Object detection has taken advantage of the advances in deep convolution...
research
11/28/2018

Semantic Part Detection via Matching: Learning to Generalize to Novel Viewpoints from Limited Training Data

Detecting semantic parts of an object is a challenging task in computer ...

Please sign up or login with your details

Forgot password? Click here to reset