Exploring Transformers for Open-world Instance Segmentation

08/08/2023
by   Jiannan Wu, et al.
0

Open-world instance segmentation is a rising task, which aims to segment all objects in the image by learning from a limited number of base-category objects. This task is challenging, as the number of unseen categories could be hundreds of times larger than that of seen categories. Recently, the DETR-like models have been extensively studied in the closed world while stay unexplored in the open world. In this paper, we utilize the Transformer for open-world instance segmentation and present SWORD. Firstly, we introduce to attach the stop-gradient operation before classification head and further add IoU heads for discovering novel objects. We demonstrate that a simple stop-gradient operation not only prevents the novel objects from being suppressed as background, but also allows the network to enjoy the merit of heuristic label assignment. Secondly, we propose a novel contrastive learning framework to enlarge the representations between objects and background. Specifically, we maintain a universal object queue to obtain the object center, and dynamically select positive and negative samples from the object queries for contrastive learning. While the previous works only focus on pursuing average recall and neglect average precision, we show the prominence of SWORD by giving consideration to both criteria. Our models achieve state-of-the-art performance in various open-world cross-category and cross-dataset generalizations. Particularly, in VOC to non-VOC setup, our method sets new state-of-the-art results of 40.0 SWORD significantly outperforms the previous best open-world model by 5.9 APm and 8.1

READ FULL TEXT

page 1

page 3

page 13

page 16

research
03/09/2023

Open-world Instance Segmentation: Top-down Learning with Bottom-up Supervision

Many top-down architectures for instance segmentation achieve significan...
research
12/03/2021

Learning to Detect Every Thing in an Open World

Many open-world applications require the detection of novel objects, yet...
research
05/22/2023

Semantic-Promoted Debiasing and Background Disambiguation for Zero-Shot Instance Segmentation

Zero-shot instance segmentation aims to detect and precisely segment obj...
research
04/10/2021

Unidentified Video Objects: A Benchmark for Dense, Open-World Segmentation

Current state-of-the-art object detection and segmentation methods work ...
research
03/18/2022

ContrastMask: Contrastive Learning to Segment Every Thing

Partially-supervised instance segmentation is a task which requests segm...
research
04/03/2023

RegionPLC: Regional Point-Language Contrastive Learning for Open-World 3D Scene Understanding

Existing 3D scene understanding tasks have achieved high performance on ...
research
08/04/2022

Open-world Contrastive Learning

Recent advance in contrastive learning has shown remarkable performance....

Please sign up or login with your details

Forgot password? Click here to reset