Prototype-based Embedding Network for Scene Graph Generation

03/13/2023
by   Chaofan Zheng, et al.
4

Current Scene Graph Generation (SGG) methods explore contextual information to predict relationships among entity pairs. However, due to the diverse visual appearance of numerous possible subject-object combinations, there is a large intra-class variation within each predicate category, e.g., "man-eating-pizza, giraffe-eating-leaf", and the severe inter-class similarity between different classes, e.g., "man-holding-plate, man-eating-pizza", in model's latent space. The above challenges prevent current SGG methods from acquiring robust features for reliable relation prediction. In this paper, we claim that the predicate's category-inherent semantics can serve as class-wise prototypes in the semantic space for relieving the challenges. To the end, we propose the Prototype-based Embedding Network (PE-Net), which models entities/predicates with prototype-aligned compact and distinctive representations and thereby establishes matching between entity pairs and predicates in a common embedding space for relation recognition. Moreover, Prototype-guided Learning (PL) is introduced to help PE-Net efficiently learn such entitypredicate matching, and Prototype Regularization (PR) is devised to relieve the ambiguous entity-predicate matching caused by the predicate's semantic overlap. Extensive experiments demonstrate that our method gains superior relation recognition capability on SGG, achieving new state-of-the-art performances on both Visual Genome and Open Images datasets.

READ FULL TEXT
research
01/27/2022

RelTR: Relation Transformer for Scene Graph Generation

Different objects in the same scene are more or less related to each oth...
research
08/12/2020

HOSE-Net: Higher Order Structure Embedded Network for Scene Graph Generation

Scene graph generation aims to produce structured representations for im...
research
07/28/2023

Panoptic Scene Graph Generation with Semantics-prototype Learning

Panoptic Scene Graph Generation (PSG) parses objects and predicts their ...
research
03/16/2022

PMAL: Open Set Recognition via Robust Prototype Mining

Open Set Recognition (OSR) has been an emerging topic. Besides recognizi...
research
08/11/2023

Semantic-embedded Similarity Prototype for Scene Recognition

Due to the high inter-class similarity caused by the complex composition...
research
07/17/2023

Pair then Relation: Pair-Net for Panoptic Scene Graph Generation

Panoptic Scene Graph (PSG) is a challenging task in Scene Graph Generati...
research
08/18/2020

Tackling the Unannotated: Scene Graph Generation with Bias-Reduced Models

Predicting a scene graph that captures visual entities and their interac...

Please sign up or login with your details

Forgot password? Click here to reset