Hierarchical Visual Primitive Experts for Compositional Zero-Shot Learning

08/08/2023
by   Hanjae Kim, et al.
0

Compositional zero-shot learning (CZSL) aims to recognize unseen compositions with prior knowledge of known primitives (attribute and object). Previous works for CZSL often suffer from grasping the contextuality between attribute and object, as well as the discriminability of visual features, and the long-tailed distribution of real-world compositional data. We propose a simple and scalable framework called Composition Transformer (CoT) to address these issues. CoT employs object and attribute experts in distinctive manners to generate representative embeddings, using the visual network hierarchically. The object expert extracts representative object embeddings from the final layer in a bottom-up manner, while the attribute expert makes attribute embeddings in a top-down manner with a proposed object-guided attention module that models contextuality explicitly. To remedy biased prediction caused by imbalanced data distribution, we develop a simple minority attribute augmentation (MAA) that synthesizes virtual samples by mixing two images and oversampling minority attribute classes. Our method achieves SoTA performance on several benchmarks, including MIT-States, C-GQA, and VAW-CZSL. We also demonstrate the effectiveness of CoT in improving visual discrimination and addressing the model bias from the imbalanced data distribution. The code is available at https://github.com/HanjaeKim98/CoT.

READ FULL TEXT

page 8

page 13

page 15

page 16

research
05/29/2023

Learning Conditional Attributes for Compositional Zero-Shot Learning

Compositional Zero-Shot Learning (CZSL) aims to train models to recogniz...
research
05/13/2022

KG-SP: Knowledge Guided Simple Primitives for Open World Compositional Zero-Shot Learning

The goal of open-world compositional zero-shot learning (OW-CZSL) is to ...
research
04/01/2020

Symmetry and Group in Attribute-Object Compositions

Attributes and objects can compose diverse compositions. To model the co...
research
08/10/2021

Relation-aware Compositional Zero-shot Learning for Attribute-Object Pair Recognition

This paper proposes a novel model for recognizing images with composite ...
research
05/17/2022

Disentangling Visual Embeddings for Attributes and Objects

We study the problem of compositional zero-shot learning for object-attr...
research
11/19/2022

Mutual Balancing in State-Object Components for Compositional Zero-Shot Learning

Compositional Zero-Shot Learning (CZSL) aims to recognize unseen composi...
research
01/29/2021

Open World Compositional Zero-Shot Learning

Compositional Zero-Shot learning (CZSL) requires to recognize state-obje...

Please sign up or login with your details

Forgot password? Click here to reset