A Dual Modality Approach For (Zero-Shot) Multi-Label Classification

08/19/2022
by   Shichao Xu, et al.
33

In computer vision, multi-label classification, including zero-shot multi-label classification are important tasks with many real-world applications. In this paper, we propose a novel algorithm, Aligned Dual moDality ClaSsifier (ADDS), which includes a Dual-Modal decoder (DM-decoder) with alignment between visual and textual features, for multi-label classification tasks. Moreover, we design a simple and yet effective method called Pyramid-Forwarding to enhance the performance for inputs with high resolutions. Extensive experiments conducted on standard multi-label benchmark datasets, MS-COCO and NUS-WIDE, demonstrate that our approach significantly outperforms previous methods and provides state-of-the-art performance for conventional multi-label classification, zero-shot multi-label classification, and an extreme case called single-to-multi label classification where models trained on single-label datasets (ImageNet-1k, ImageNet-21k) are tested on multi-label ones (MS-COCO and NUS-WIDE). We also analyze how visual-textual alignment contributes to the proposed approach, validate the significance of the DM-decoder, and demonstrate the effectiveness of Pyramid-Forwarding on vision transformer.

READ FULL TEXT

page 3

page 4

page 11

page 12

research
01/27/2021

Generative Multi-Label Zero-Shot Learning

Multi-label zero-shot learning strives to classify images into multiple ...
research
06/20/2022

DualCoOp: Fast Adaptation to Multi-Label Recognition with Limited Annotations

Solving multi-label recognition (MLR) for images in the low-label regime...
research
08/03/2023

DualCoOp++: Fast and Effective Adaptation to Multi-Label Recognition with Limited Annotations

Multi-label image recognition in the low-label regime is a task of great...
research
11/25/2021

ML-Decoder: Scalable and Versatile Classification Head

In this paper, we introduce ML-Decoder, a new attention-based classifica...
research
09/29/2021

Can multi-label classification networks know what they don't know?

Estimating out-of-distribution (OOD) uncertainty is a central challenge ...
research
04/04/2021

Improving Pretrained Models for Zero-shot Multi-label Text Classification through Reinforced Label Hierarchy Reasoning

Exploiting label hierarchies has become a promising approach to tackling...
research
05/28/2021

pRSL: Interpretable Multi-label Stacking by Learning Probabilistic Rules

A key task in multi-label classification is modeling the structure betwe...

Please sign up or login with your details

Forgot password? Click here to reset