A Generic Visualization Approach for Convolutional Neural Networks

07/19/2020
by   Ahmed Taha, et al.
59

Retrieval networks are essential for searching and indexing. Compared to classification networks, attention visualization for retrieval networks is hardly studied. We formulate attention visualization as a constrained optimization problem. We leverage the unit L2-Norm constraint as an attention filter (L2-CAF) to localize attention in both classification and retrieval networks. Unlike recent literature, our approach requires neither architectural changes nor fine-tuning. Thus, a pre-trained network's performance is never undermined L2-CAF is quantitatively evaluated using weakly supervised object localization. State-of-the-art results are achieved on classification networks. For retrieval networks, significant improvement margins are achieved over a Grad-CAM baseline. Qualitative evaluation demonstrates how the L2-CAF visualizes attention per frame for a recurrent retrieval network. Further ablation studies highlight the computational cost of our approach and compare L2-CAF with other feasible alternatives. Code available at https://bit.ly/3iDBLFv

READ FULL TEXT

page 10

page 12

page 13

page 22

research
04/14/2022

ViTOL: Vision Transformer for Weakly Supervised Object Localization

Weakly supervised object localization (WSOL) aims at predicting object l...
research
11/23/2022

VoP: Text-Video Co-operative Prompt Tuning for Cross-Modal Retrieval

Many recent studies leverage the pre-trained CLIP for text-video cross-m...
research
02/04/2016

Self-Transfer Learning for Fully Weakly Supervised Object Localization

Recent advances of deep learning have achieved remarkable performances i...
research
08/24/2022

DPTDR: Deep Prompt Tuning for Dense Passage Retrieval

Deep prompt tuning (DPT) has gained great success in most natural langua...
research
10/07/2020

Channel Recurrent Attention Networks for Video Pedestrian Retrieval

Full attention, which generates an attention value per element of the in...
research
03/04/2021

SVMax: A Feature Embedding Regularizer

A neural network regularizer (e.g., weight decay) boosts performance by ...
research
08/05/2019

A Weakly-Supervised Attention-based Visualization Tool for Assessing Political Affiliation

In this work, we seek to finetune a weakly-supervised expert-guided Deep...

Please sign up or login with your details

Forgot password? Click here to reset