Generative Prompt Model for Weakly Supervised Object Localization

07/19/2023
by   Yuzhong Zhao, et al.
0

Weakly supervised object localization (WSOL) remains challenging when learning object localization models from image category labels. Conventional methods that discriminatively train activation models ignore representative yet less discriminative object parts. In this study, we propose a generative prompt model (GenPromp), defining the first generative pipeline to localize less discriminative object parts by formulating WSOL as a conditional image denoising procedure. During training, GenPromp converts image category labels to learnable prompt embeddings which are fed to a generative model to conditionally recover the input image with noise and learn representative embeddings. During inference, enPromp combines the representative embeddings with discriminative embeddings (queried from an off-the-shelf vision-language model) for both representative and discriminative capacity. The combined embeddings are finally used to generate multi-scale high-quality attention maps, which facilitate localizing full object extent. Experiments on CUB-200-2011 and ILSVRC show that GenPromp respectively outperforms the best discriminative models by 5.2 for WSOL with the generative model. Code is available at https://github.com/callsys/GenPromp.

READ FULL TEXT

page 2

page 5

page 6

page 7

page 16

page 17

page 18

page 19

research
03/12/2021

Discriminative Region Suppression for Weakly-Supervised Semantic Segmentation

Weakly-supervised semantic segmentation (WSSS) using image-level labels ...
research
04/14/2022

ViTOL: Vision Transformer for Weakly Supervised Object Localization

Weakly supervised object localization (WSOL) aims at predicting object l...
research
08/03/2022

Re-Attention Transformer for Weakly Supervised Object Localization

Weakly supervised object localization is a challenging task which aims t...
research
02/22/2018

Improved Techniques For Weakly-Supervised Object Localization

We propose an improved technique for weakly-supervised object localizati...
research
03/03/2022

Weakly Supervised Object Localization as Domain Adaption

Weakly supervised object localization (WSOL) focuses on localizing objec...
research
01/03/2022

CaFT: Clustering and Filter on Tokens of Transformer for Weakly Supervised Object Localization

Weakly supervised object localization (WSOL) is a challenging task to lo...
research
06/05/2023

Weakly-Supervised Conditional Embedding for Referred Visual Search

This paper presents a new approach to image similarity search in the con...

Please sign up or login with your details

Forgot password? Click here to reset