Learning Attention as Disentangler for Compositional Zero-shot Learning

03/27/2023
by   Shaozhe Hao, et al.
0

Compositional zero-shot learning (CZSL) aims at learning visual concepts (i.e., attributes and objects) from seen compositions and combining concept knowledge into unseen compositions. The key to CZSL is learning the disentanglement of the attribute-object composition. To this end, we propose to exploit cross-attentions as compositional disentanglers to learn disentangled concept embeddings. For example, if we want to recognize an unseen composition "yellow flower", we can learn the attribute concept "yellow" and object concept "flower" from different yellow objects and different flowers respectively. To further constrain the disentanglers to learn the concept of interest, we employ a regularization at the attention level. Specifically, we adapt the earth mover's distance (EMD) as a feature similarity metric in the cross-attention module. Moreover, benefiting from concept disentanglement, we improve the inference process and tune the prediction score by combining multiple concept probabilities. Comprehensive experiments on three CZSL benchmark datasets demonstrate that our method significantly outperforms previous works in both closed- and open-world settings, establishing a new state-of-the-art.

READ FULL TEXT

page 8

page 13

page 14

research
06/01/2022

Learning Invariant Visual Representations for Compositional Zero-Shot Learning

Compositional Zero-Shot Learning (CZSL) aims to recognize novel composit...
research
07/12/2021

Zero-Shot Compositional Concept Learning

In this paper, we study the problem of recognizing compositional attribu...
research
10/07/2022

LOCL: Learning Object-Attribute Composition using Localization

This paper describes LOCL (Learning Object Attribute Composition using L...
research
03/01/2023

Distilled Reverse Attention Network for Open-world Compositional Zero-Shot Learning

Open-World Compositional Zero-Shot Learning (OW-CZSL) aims to recognize ...
research
10/09/2021

Learning Single/Multi-Attribute of Object with Symmetry and Group

Attributes and objects can compose diverse compositions. To model the co...
research
05/26/2023

CAILA: Concept-Aware Intra-Layer Adapters for Compositional Zero-Shot Learning

Compositionality, the ability to combine existing concepts and generaliz...
research
03/27/2023

Troika: Multi-Path Cross-Modal Traction for Compositional Zero-Shot Learning

Recent compositional zero-shot learning (CZSL) methods adapt pre-trained...

Please sign up or login with your details

Forgot password? Click here to reset