Semantic-aligned Fusion Transformer for One-shot Object Detection

03/17/2022
by   Yizhou Zhao, et al.
0

One-shot object detection aims at detecting novel objects according to merely one given instance. With extreme data scarcity, current approaches explore various feature fusions to obtain directly transferable meta-knowledge. Yet, their performances are often unsatisfactory. In this paper, we attribute this to inappropriate correlation methods that misalign query-support semantics by overlooking spatial structures and scale variances. Upon analysis, we leverage the attention mechanism and propose a simple but effective architecture named Semantic-aligned Fusion Transformer (SaFT) to resolve these issues. Specifically, we equip SaFT with a vertical fusion module (VFM) for cross-scale semantic enhancement and a horizontal fusion module (HFM) for cross-sample feature fusion. Together, they broaden the vision for each feature point from the support to a whole augmented feature pyramid from the query, facilitating semantic-aligned associations. Extensive experiments on multiple benchmarks demonstrate the superiority of our framework. Without fine-tuning on novel classes, it brings significant performance gains to one-stage baselines, lifting state-of-the-art results to a higher level.

READ FULL TEXT

page 1

page 2

page 4

page 7

page 8

page 10

research
03/23/2021

Meta-DETR: Few-Shot Object Detection via Unified Image-Level Meta-Learning

Few-shot object detection aims at detecting novel objects with only a fe...
research
10/30/2021

Cross-Modality Fusion Transformer for Multispectral Object Detection

Multispectral image pairs can provide the combined information, making o...
research
10/26/2021

Plug-and-Play Few-shot Object Detection with Meta Strategy and Explicit Localization Inference

Aiming at recognizing and localizing the object of novel categories by a...
research
01/03/2023

Reference Twice: A Simple and Unified Baseline for Few-Shot Instance Segmentation

Few Shot Instance Segmentation (FSIS) requires models to detect and segm...
research
12/01/2022

Concealed Object Detection for Passive Millimeter-Wave Security Imaging Based on Task-Aligned Detection Transformer

Passive millimeter-wave (PMMW) is a significant potential technique for ...
research
08/20/2021

DeFRCN: Decoupled Faster R-CNN for Few-Shot Object Detection

Few-shot object detection, which aims at detecting novel objects rapidly...
research
07/28/2022

Semantic-Aligned Matching for Enhanced DETR Convergence and Multi-Scale Feature Fusion

The recently proposed DEtection TRansformer (DETR) has established a ful...

Please sign up or login with your details

Forgot password? Click here to reset