Query2Label: A Simple Transformer Way to Multi-Label Classification

07/22/2021
by   Shilong Liu, et al.
10

This paper presents a simple and effective approach to solving the multi-label classification problem. The proposed approach leverages Transformer decoders to query the existence of a class label. The use of Transformer is rooted in the need of extracting local discriminative features adaptively for different labels, which is a strongly desired property due to the existence of multiple objects in one image. The built-in cross-attention module in the Transformer decoder offers an effective way to use label embeddings as queries to probe and pool class-related features from a feature map computed by a vision backbone for subsequent binary classifications. Compared with prior works, the new framework is simple, using standard Transformers and vision backbones, and effective, consistently outperforming all previous works on five multi-label classification data sets, including MS-COCO, PASCAL VOC, NUS-WIDE, and Visual Genome. Particularly, we establish 91.3% mAP on MS-COCO. We hope its compact structure, simple implementation, and superior performance serve as a strong baseline for multi-label classification tasks and future studies. The code will be available soon at https://github.com/SlongLiu/query2labels.

READ FULL TEXT

page 8

page 9

page 11

page 12

page 13

research
06/11/2021

MlTr: Multi-label Classification with Transformer

The task of multi-label image classification is to recognize all the obj...
research
11/25/2021

ML-Decoder: Scalable and Versatile Classification Head

In this paper, we introduce ML-Decoder, a new attention-based classifica...
research
12/10/2021

Visual Transformers with Primal Object Queries for Multi-Label Image Classification

Multi-label image classification is about predicting a set of class labe...
research
08/21/2023

LDCSF: Local depth convolution-based Swim framework for classifying multi-label histopathology images

Histopathological images are the gold standard for diagnosing liver canc...
research
09/14/2022

Combining Metric Learning and Attention Heads For Accurate and Efficient Multilabel Image Classification

Multi-label image classification allows predicting a set of labels from ...
research
09/29/2021

Can multi-label classification networks know what they don't know?

Estimating out-of-distribution (OOD) uncertainty is a central challenge ...
research
05/01/2020

Investigating Class-level Difficulty Factors in Multi-label Classification Problems

This work investigates the use of class-level difficulty factors in mult...

Please sign up or login with your details

Forgot password? Click here to reset