Zero-Shot Scene Graph Generation via Triplet Calibration and Reduction

09/07/2023
by   Jiankai Li, et al.
0

Scene Graph Generation (SGG) plays a pivotal role in downstream vision-language tasks. Existing SGG methods typically suffer from poor compositional generalizations on unseen triplets. They are generally trained on incompletely annotated scene graphs that contain dominant triplets and tend to bias toward these seen triplets during inference. To address this issue, we propose a Triplet Calibration and Reduction (T-CAR) framework in this paper. In our framework, a triplet calibration loss is first presented to regularize the representations of diverse triplets and to simultaneously excavate the unseen triplets in incompletely annotated training scene graphs. Moreover, the unseen space of scene graphs is usually several times larger than the seen space since it contains a huge number of unrealistic compositions. Thus, we propose an unseen space reduction loss to shift the attention of excavation to reasonable unseen compositions to facilitate the model training. Finally, we propose a contextual encoder to improve the compositional generalizations of unseen triplets by explicitly modeling the relative spatial relations between subjects and objects. Extensive experiments show that our approach achieves consistent improvements for zero-shot SGG over state-of-the-art methods. The code is available at https://github.com/jkli1998/T-CAR.

READ FULL TEXT

page 2

page 5

page 12

page 17

research
05/03/2021

Learning Graph Embeddings for Open World Compositional Zero-Shot Learning

Compositional Zero-Shot learning (CZSL) aims to recognize unseen composi...
research
01/09/2021

Entropy-Based Uncertainty Calibration for Generalized Zero-Shot Learning

Compared to conventional zero-shot learning (ZSL) where recognising unse...
research
08/22/2022

Reference-Limited Compositional Zero-Shot Learning

Compositional zero-shot learning (CZSL) refers to recognizing unseen com...
research
06/29/2022

Siamese Contrastive Embedding Network for Compositional Zero-Shot Learning

Compositional Zero-Shot Learning (CZSL) aims to recognize unseen composi...
research
08/30/2019

TGG: Transferable Graph Generation for Zero-shot and Few-shot Learning

Zero-shot and few-shot learning aim to improve generalization to unseen ...
research
07/07/2021

Mitigating Generation Shifts for Generalized Zero-Shot Learning

Generalized Zero-Shot Learning (GZSL) is the task of leveraging semantic...
research
03/03/2021

Energy-Based Learning for Scene Graph Generation

Traditional scene graph generation methods are trained using cross-entro...

Please sign up or login with your details

Forgot password? Click here to reset