Panoptic Scene Graph Generation

07/22/2022
by   Jingkang Yang, et al.
0

Existing research addresses scene graph generation (SGG) – a critical technology for scene understanding in images – from a detection perspective, i.e., objects are detected using bounding boxes followed by prediction of their pairwise relationships. We argue that such a paradigm causes several problems that impede the progress of the field. For instance, bounding box-based labels in current datasets usually contain redundant classes like hairs, and leave out background information that is crucial to the understanding of context. In this work, we introduce panoptic scene graph generation (PSG), a new problem task that requires the model to generate a more comprehensive scene graph representation based on panoptic segmentations rather than rigid bounding boxes. A high-quality PSG dataset, which contains 49k well-annotated overlapping images from COCO and Visual Genome, is created for the community to keep track of its progress. For benchmarking, we build four two-stage baselines, which are modified from classic methods in SGG, and two one-stage baselines called PSGTR and PSGFormer, which are based on the efficient Transformer-based detector, i.e., DETR. While PSGTR uses a set of queries to directly learn triplets, PSGFormer separately models the objects and relations in the form of queries from two Transformer decoders, followed by a prompting-like relation-object matching mechanism. In the end, we share insights on open challenges and future directions.

READ FULL TEXT

page 2

page 13

page 21

page 22

page 27

research
03/14/2018

Approximate Query Matching for Image Retrieval

Traditional image recognition involves identifying the key object in a p...
research
03/30/2021

Fully Convolutional Scene Graph Generation

This paper presents a fully convolutional scene graph generation (FCSGG)...
research
02/06/2023

1st Place Solution for PSG competition with ECCV'22 SenseHuman Workshop

Panoptic Scene Graph (PSG) generation aims to generate scene graph repre...
research
04/29/2021

Segmentation-grounded Scene Graph Generation

Scene graph generation has emerged as an important problem in computer v...
research
05/21/2023

PanoContext-Former: Panoramic Total Scene Understanding with a Transformer

Panoramic image enables deeper understanding and more holistic perceptio...
research
05/25/2019

Efficient Object Annotation via Speaking and Pointing

Deep neural networks deliver state-of-the-art visual recognition, but th...
research
09/06/2023

RepSGG: Novel Representations of Entities and Relationships for Scene Graph Generation

Scene Graph Generation (SGG) has achieved significant progress recently....

Please sign up or login with your details

Forgot password? Click here to reset