ODAM: Gradient-based instance-specific visual explanations for object detection

by   Chenyang Zhao, et al.

We propose the gradient-weighted Object Detector Activation Maps (ODAM), a visualized explanation technique for interpreting the predictions of object detectors. Utilizing the gradients of detector targets flowing into the intermediate feature maps, ODAM produces heat maps that show the influence of regions on the detector's decision for each predicted attribute. Compared to previous works classification activation maps (CAM), ODAM generates instance-specific explanations rather than class-specific ones. We show that ODAM is applicable to both one-stage detectors and two-stage detectors with different types of detector backbones and heads, and produces higher-quality visual explanations than the state-of-the-art both effectively and efficiently. We next propose a training scheme, Odam-Train, to improve the explanation ability on object discrimination of the detector through encouraging consistency between explanations for detections on the same object, and distinct explanations for detections on different objects. Based on the heat maps produced by ODAM with Odam-Train, we propose Odam-NMS, which considers the information of the model's explanation for each prediction to distinguish the duplicate detected objects. We present a detailed analysis of the visualized explanations of detectors and carry out extensive experiments to validate the effectiveness of the proposed ODAM.


page 20

page 21

page 22

page 23

page 24

page 26

page 27

page 28


Black-box Explanation of Object Detectors via Saliency Maps

We propose D-RISE, a method for generating visual explanations for the p...

Improving Visual Grounding by Encouraging Consistent Gradient-based Explanations

We propose a margin-based loss for vision-language model pretraining tha...

G-CAME: Gaussian-Class Activation Mapping Explainer for Object Detectors

Nowadays, deep neural networks for object detection in images are very p...

Improving Object Detection with Inverted Attention

Improving object detectors against occlusion, blur and noise is a critic...

GAM: Explainable Visual Similarity and Classification via Gradient Activation Maps

We present Gradient Activation Maps (GAM) - a machinery for explaining p...

Crown-CAM: Reliable Visual Explanations for Tree Crown Detection in Aerial Images

Visual explanation of "black-box" models has enabled researchers and exp...

DExT: Detector Explanation Toolkit

State-of-the-art object detectors are treated as black boxes due to thei...

Please sign up or login with your details

Forgot password? Click here to reset