Foreground Guidance and Multi-Layer Feature Fusion for Unsupervised Object Discovery with Transformers

10/24/2022
by   Zhiwei Lin, et al.
0

Unsupervised object discovery (UOD) has recently shown encouraging progress with the adoption of pre-trained Transformer features. However, current methods based on Transformers mainly focus on designing the localization head (e.g., seed selection-expansion and normalized cut) and overlook the importance of improving Transformer features. In this work, we handle UOD task from the perspective of feature enhancement and propose FOReground guidance and MUlti-LAyer feature fusion for unsupervised object discovery, dubbed FORMULA. Firstly, we present a foreground guidance strategy with an off-the-shelf UOD detector to highlight the foreground regions on the feature maps and then refine object locations in an iterative fashion. Moreover, to solve the scale variation issues in object detection, we design a multi-layer feature fusion module that aggregates features responding to objects at different scales. The experiments on VOC07, VOC12, and COCO 20k show that the proposed FORMULA achieves new state-of-the-art results on unsupervised object discovery. The code will be released at https://github.com/VDIGPKU/FORMULA.

READ FULL TEXT

page 1

page 3

page 4

page 6

page 7

page 8

research
05/21/2022

Boosting Camouflaged Object Detection with Dual-Task Interactive Transformer

Camouflaged object detection intends to discover the concealed objects h...
research
04/11/2023

MOST: Multiple Object localization with Self-supervised Transformers for object discovery

We tackle the challenging task of unsupervised object localization in th...
research
09/29/2021

Localizing Objects with Self-Supervised Transformers and no Labels

Localizing objects in image collections without supervision can help to ...
research
12/04/2020

F2Net: Learning to Focus on the Foreground for Unsupervised Video Object Segmentation

Although deep learning based methods have achieved great progress in uns...
research
03/25/2023

Adaptive Sparse Convolutional Networks with Global Context Enhancement for Faster Object Detection on Drone Images

Object detection on drone images with low-latency is an important but ch...
research
04/05/2022

RBGNet: Ray-based Grouping for 3D Object Detection

As a fundamental problem in computer vision, 3D object detection is expe...
research
03/21/2021

Learning Calibrated-Guidance for Object Detection in Aerial Images

Recently, the study on object detection in aerial images has made tremen...

Please sign up or login with your details

Forgot password? Click here to reset