CAT: LoCalization and IdentificAtion Cascade Detection Transformer for Open-World Object Detection

01/05/2023
by   Shuailei Ma, et al.
0

Open-world object detection (OWOD), as a more general and challenging goal, requires the model trained from data on known objects to detect both known and unknown objects and incrementally learn to identify these unknown objects. The existing works which employ standard detection framework and fixed pseudo-labelling mechanism (PLM) have the following problems: (i) The inclusion of detecting unknown objects substantially reduces the model's ability to detect known ones. (ii) The PLM does not adequately utilize the priori knowledge of inputs. (iii) The fixed selection manner of PLM cannot guarantee that the model is trained in the right direction. We observe that humans subconsciously prefer to focus on all foreground objects and then identify each one in detail, rather than localize and identify a single object simultaneously, for alleviating the confusion. This motivates us to propose a novel solution called CAT: LoCalization and IdentificAtion Cascade Detection Transformer which decouples the detection process via the shared decoder in the cascade decoding way. In the meanwhile, we propose the self-adaptive pseudo-labelling mechanism which combines the model-driven with input-driven PLM and self-adaptively generates robust pseudo-labels for unknown objects, significantly improving the ability of CAT to retrieve unknown objects. Comprehensive experiments on two benchmark datasets, i.e., MS-COCO and PASCAL VOC, show that our model outperforms the state-of-the-art in terms of all metrics in the task of OWOD, incremental object detection (IOD) and open-set detection.

READ FULL TEXT

page 3

page 5

page 7

page 11

page 13

page 14

research
12/02/2021

OW-DETR: Open-world Detection Transformer

Open-world object detection (OWOD) is a challenging computer vision prob...
research
03/21/2023

Detecting the open-world objects with the help of the Brain

Open World Object Detection (OWOD) is a novel computer vision task with ...
research
12/06/2022

Open World DETR: Transformer based Open World Object Detection

Open world object detection aims at detecting objects that are absent in...
research
03/03/2021

Towards Open World Object Detection

Humans have a natural instinct to identify unknown object instances in t...
research
06/04/2023

USD: Unknown Sensitive Detector Empowered by Decoupled Objectness and Segment Anything Model

Open World Object Detection (OWOD) is a novel and challenging computer v...
research
04/16/2019

Steganographer Identification

Conventional steganalysis detects the presence of steganography within s...
research
12/29/2015

A framework for robust object multi-detection with a vote aggregation and a cascade filtering

This paper presents a framework designed for the multi-object detection ...

Please sign up or login with your details

Forgot password? Click here to reset