Knowledge-augmented Few-shot Visual Relation Detection

03/09/2023
by   Tianyu Yu, et al.
5

Visual Relation Detection (VRD) aims to detect relationships between objects for image understanding. Most existing VRD methods rely on thousands of training samples of each relationship to achieve satisfactory performance. Some recent papers tackle this problem by few-shot learning with elaborately designed pipelines and pre-trained word vectors. However, the performance of existing few-shot VRD models is severely hampered by the poor generalization capability, as they struggle to handle the vast semantic diversity of visual relationships. Nonetheless, humans have the ability to learn new relationships with just few examples based on their knowledge. Inspired by this, we devise a knowledge-augmented, few-shot VRD framework leveraging both textual knowledge and visual relation knowledge to improve the generalization ability of few-shot VRD. The textual knowledge and visual relation knowledge are acquired from a pre-trained language model and an automatically constructed visual relation knowledge graph, respectively. We extensively validate the effectiveness of our framework. Experiments conducted on three benchmarks from the commonly used Visual Genome dataset show that our performance surpasses existing state-of-the-art models with a large improvement.

READ FULL TEXT

page 1

page 8

research
10/20/2022

Visual-Semantic Contrastive Alignment for Few-Shot Image Classification

Few-Shot learning aims to train and optimize a model that can adapt to u...
research
10/01/2019

Compensating Supervision Incompleteness with Prior Knowledge in Semantic Image Interpretation

Semantic Image Interpretation is the task of extracting a structured sem...
research
02/22/2022

One-shot Scene Graph Generation

As a structured representation of the image content, the visual scene gr...
research
07/25/2023

GraspGPT: Leveraging Semantic Knowledge from a Large Language Model for Task-Oriented Grasping

Task-oriented grasping (TOG) refers to the problem of predicting grasps ...
research
04/21/2023

RPLKG: Robust Prompt Learning with Knowledge Graph

Large-scale pre-trained models have been known that they are transferabl...
research
03/12/2021

Inductive Relation Prediction by BERT

Relation prediction in knowledge graphs is dominated by embedding based ...
research
04/25/2019

Scene Graph Prediction with Limited Labels

Visual knowledge bases such as Visual Genome power numerous applications...

Please sign up or login with your details

Forgot password? Click here to reset