Revisiting Transformer for Point Cloud-based 3D Scene Graph Generation

03/20/2023
by   Changsheng Lv, et al.
0

In this paper, we propose the semantic graph Transformer (SGT) for the 3D scene graph generation. The task aims to parse a cloud point-based scene into a semantic structural graph, with the core challenge of modeling the complex global structure. Existing methods based on graph convolutional networks (GCNs) suffer from the over-smoothing dilemma and could only propagate information from limited neighboring nodes. In contrast, our SGT uses Transformer layers as the base building block to allow global information passing, with two types of proposed Transformer layers tailored for the 3D scene graph generation task. Specifically, we introduce the graph embedding layer to best utilize the global information in graph edges while maintaining comparable computation costs. Additionally, we propose the semantic injection layer to leverage categorical text labels and visual object knowledge. We benchmark our SGT on the established 3DSSG benchmark and achieve a 35.9 relationship prediction's R@50 and an 80.40 scenes over the state-of-the-art. Our analyses further show SGT's superiority in the long-tailed and zero-shot scenarios. We will release the code and model.

READ FULL TEXT

page 1

page 7

page 14

research
09/11/2021

BGT-Net: Bidirectional GRU Transformer Network for Scene Graph Generation

Scene graphs are nodes and edges consisting of objects and object-object...
research
11/21/2021

CpT: Convolutional Point Transformer for 3D Point Cloud Processing

We present CpT: Convolutional point Transformer - a novel deep learning ...
research
05/24/2023

GTNet: Graph Transformer Network for 3D Point Cloud Classification and Semantic Segmentation

Recently, graph-based and Transformer-based deep learning networks have ...
research
02/22/2022

One-shot Scene Graph Generation

As a structured representation of the image content, the visual scene gr...
research
03/25/2022

Gransformer: Transformer-based Graph Generation

Transformers have become widely used in modern models for various tasks ...
research
03/14/2023

Graph Transformer GANs for Graph-Constrained House Generation

We present a novel graph Transformer generative adversarial network (GTG...
research
03/25/2023

VL-SAT: Visual-Linguistic Semantics Assisted Training for 3D Semantic Scene Graph Prediction in Point Cloud

The task of 3D semantic scene graph (3DSSG) prediction in the point clou...

Please sign up or login with your details

Forgot password? Click here to reset