Transformer-based Multi-Instance Learning for Weakly Supervised Object Detection

03/27/2023
by   Zhaofei Wang, et al.
0

Weakly Supervised Object Detection (WSOD) enables the training of object detection models using only image-level annotations. State-of-the-art WSOD detectors commonly rely on multi-instance learning (MIL) as the backbone of their detectors and assume that the bounding box proposals of an image are independent of each other. However, since such approaches only utilize the highest score proposal and discard the potentially useful information from other proposals, their independent MIL backbone often limits models to salient parts of an object or causes them to detect only one object per class. To solve the above problems, we propose a novel backbone for WSOD based on our tailored Vision Transformer named Weakly Supervised Transformer Detection Network (WSTDN). Our algorithm is not only the first to demonstrate that self-attention modules that consider inter-instance relationships are effective backbones for WSOD, but also we introduce a novel bounding box mining method (BBM) integrated with a memory transfer refinement (MTR) procedure to utilize the instance dependencies for facilitating instance refinements. Experimental results on PASCAL VOC2007 and VOC2012 benchmarks demonstrate the effectiveness of our proposed WSTDN and modified instance refinement modules.

READ FULL TEXT
research
07/11/2022

Scaling Novel Object Detection with Weakly Supervised Detection Transformers

Weakly supervised object detection (WSOD) enables object detectors to be...
research
04/20/2021

Transformer Transforms Salient Object Detection and Camouflaged Object Detection

The transformer networks, which originate from machine translation, are ...
research
09/11/2023

Gall Bladder Cancer Detection from US Images with Only Image Level Labels

Automated detection of Gallbladder Cancer (GBC) from Ultrasound (US) ima...
research
11/27/2019

Towards Precise End-to-end Weakly Supervised Object Detection Network

It is challenging for weakly supervised object detection network to prec...
research
07/09/2018

PCL: Proposal Cluster Learning for Weakly Supervised Object Detection

Weakly Supervised Object Detection (WSOD), using only image-level annota...
research
04/09/2020

Instance-aware, Context-focused, and Memory-efficient Weakly Supervised Object Detection

Weakly supervised learning has emerged as a compelling tool for object d...
research
10/10/2015

Spatial Semantic Regularisation for Large Scale Object Detection

Large scale object detection with thousands of classes introduces the pr...

Please sign up or login with your details

Forgot password? Click here to reset