Fusion-GCN: Multimodal Action Recognition using Graph Convolutional Networks

09/27/2021
by   Michael Duhme, et al.
0

In this paper, we present Fusion-GCN, an approach for multimodal action recognition using Graph Convolutional Networks (GCNs). Action recognition methods based around GCNs recently yielded state-of-the-art performance for skeleton-based action recognition. With Fusion-GCN, we propose to integrate various sensor data modalities into a graph that is trained using a GCN model for multi-modal action recognition. Additional sensor measurements are incorporated into the graph representation, either on a channel dimension (introducing additional node attributes) or spatial dimension (introducing new nodes). Fusion-GCN was evaluated on two public available datasets, the UTD-MHAD- and MMACT datasets, and demonstrates flexible fusion of RGB sequences, inertial measurements and skeleton sequences. Our approach gets comparable results on the UTD-MHAD dataset and improves the baseline on the large-scale MMACT dataset by a significant margin of up to 12.37 with the fusion of skeleton estimates and accelerometer measurements.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/10/2022

Pose-Guided Graph Convolutional Networks for Skeleton-Based Action Recognition

Graph convolutional networks (GCNs), which can model the human body skel...
research
05/30/2023

High-Performance Inference Graph Convolutional Networks for Skeleton-Based Action Recognition

Recently, significant achievements have been made in skeleton-based huma...
research
03/13/2020

Gimme Signals: Discriminative signal encoding for multimodal activity recognition

We present a simple, yet effective and flexible method for action recogn...
research
06/30/2022

Skeleton-based Action Recognition via Adaptive Cross-Form Learning

Skeleton-based action recognition aims to project skeleton sequences to ...
research
11/13/2021

A Central Difference Graph Convolutional Operator for Skeleton-Based Action Recognition

This paper proposes a new graph convolutional operator called central di...
research
07/30/2020

Mix Dimension in Poincaré Geometry for 3D Skeleton-based Action Recognition

Graph Convolutional Networks (GCNs) have already demonstrated their powe...
research
11/04/2021

Skeleton-Split Framework using Spatial Temporal Graph Convolutional Networks for Action Recogntion

There has been a dramatic increase in the volume of videos and their rel...

Please sign up or login with your details

Forgot password? Click here to reset