Multi-Task Edge Prediction in Temporally-Dynamic Video Graphs

12/06/2022
by   Osman Ülger, et al.
4

Graph neural networks have shown to learn effective node representations, enabling node-, link-, and graph-level inference. Conventional graph networks assume static relations between nodes, while relations between entities in a video often evolve over time, with nodes entering and exiting dynamically. In such temporally-dynamic graphs, a core problem is inferring the future state of spatio-temporal edges, which can constitute multiple types of relations. To address this problem, we propose MTD-GNN, a graph network for predicting temporally-dynamic edges for multiple types of relations. We propose a factorized spatio-temporal graph attention layer to learn dynamic node representations and present a multi-task edge prediction loss that models multiple relations simultaneously. The proposed architecture operates on top of scene graphs that we obtain from videos through object detection and spatio-temporal linking. Experimental evaluations on ActionGenome and CLEVRER show that modeling multiple relations in our temporally-dynamic graph network can be mutually beneficial, outperforming existing static and spatio-temporal graph neural networks, as well as state-of-the-art predicate classification methods.

READ FULL TEXT

page 2

page 4

page 10

research
02/03/2021

Learning Graph Representations

Social and information networks are gaining huge popularity recently due...
research
12/07/2022

Dynamic Graph Node Classification via Time Augmentation

Node classification for graph-structured data aims to classify nodes who...
research
09/17/2020

Dynamic Regions Graph Neural Networks for Spatio-Temporal Reasoning

Graph Neural Networks are perfectly suited to capture latent interaction...
research
05/09/2020

Understanding Dynamic Scenes using Graph Convolution Networks

We present a novel Multi Relational Graph Convolutional Network (MRGCN) ...
research
06/25/2020

Unsupervised Video Decomposition using Spatio-temporal Iterative Inference

Unsupervised multi-object scene decomposition is a fast-emerging problem...
research
09/16/2021

STD-Trees: Spatio-temporal Deformable Trees for Multirotors Kinodynamic Planning

In constrained solution spaces with a huge number of homotopy classes, s...
research
04/21/2019

Storing and Querying Large-Scale Spatio-Temporal Graphs with High-Throughput Edge Insertions

Real-world graphs often contain spatio-temporal information and evolve o...

Please sign up or login with your details

Forgot password? Click here to reset