Optimizing Relevance Maps of Vision Transformers Improves Robustness

06/02/2022
by   Hila Chefer, et al.
0

It has been observed that visual classification models often rely mostly on the image background, neglecting the foreground, which hurts their robustness to distribution changes. To alleviate this shortcoming, we propose to monitor the model's relevancy signal and manipulate it such that the model is focused on the foreground object. This is done as a finetuning step, involving relatively few samples consisting of pairs of images and their associated foreground masks. Specifically, we encourage the model's relevancy map (i) to assign lower relevance to background regions, (ii) to consider as much information as possible from the foreground, and (iii) we encourage the decisions to have high confidence. When applied to Vision Transformer (ViT) models, a marked improvement in robustness to domain shifts is observed. Moreover, the foreground masks can be obtained automatically, from a self-supervised variant of the ViT model itself; therefore no additional supervision is required.

READ FULL TEXT

page 2

page 4

page 5

page 14

page 20

page 27

page 30

page 33

research
06/02/2023

Evaluating The Robustness of Self-Supervised Representations to Background/Foreground Removal

Despite impressive empirical advances of SSL in solving various tasks, t...
research
01/26/2022

A Comprehensive Study of Image Classification Model Sensitivity to Foregrounds, Backgrounds, and Visual Attributes

While datasets with single-label supervision have propelled rapid advanc...
research
08/22/2022

FurryGAN: High Quality Foreground-aware Image Synthesis

Foreground-aware image synthesis aims to generate images as well as thei...
research
11/18/2022

Invariant Learning via Diffusion Dreamed Distribution Shifts

Though the background is an important signal for image classification, o...
research
10/07/2021

Virtual Multi-Modality Self-Supervised Foreground Matting for Human-Object Interaction

Most existing human matting algorithms tried to separate pure human-only...
research
03/04/2020

Foreground model recognition through Neural Networks for CMB B-mode observations

In this work we present a Neural Network (NN) algorithm for the identifi...
research
11/21/2020

Contextual Interference Reduction by Selective Fine-Tuning of Neural Networks

Feature disentanglement of the foreground target objects and the backgro...

Please sign up or login with your details

Forgot password? Click here to reset