Stacked Spatio-Temporal Graph Convolutional Networks for Action Segmentation

11/26/2018
by   Pallabi Ghosh, et al.
0

We propose novel Stacked Spatio-Temporal Graph Convolutional Networks (Stacked-STGCN) for action segmentation, i.e., predicting and localizing a sequence of actions over long videos. We extend the Spatio-Temporal Graph Convolutional Network (STGCN) originally proposed for skeleton-based action recognition to enable nodes with different characteristics (e.g., scene, actor, object, action, etc.), feature descriptors with varied lengths, and arbitrary temporal edge connections to account for large graph deformation commonly associated with complex activities. We further introduce the stacked hourglass architecture to STGCN to leverage the advantages of an encoder-decoder design for improved generalization performance and localization accuracy. We explore various descriptors such as frame-level VGG, segment-level I3D, RCNN-based object, etc. as node descriptors to enable action segmentation based on joint inference over comprehensive contextual information. We show results on CAD120 (which provides pre-computed node features and edge weights for fair performance comparison across algorithms) as well as a more complex real-world activity dataset, Charades. Our Stacked-STGCN in general achieves 4.1 performance improvement over the best reported results in F1 score on CAD120 and 1.3

READ FULL TEXT

page 2

page 8

research
08/24/2023

DD-GCN: Directed Diffusion Graph Convolutional Network for Skeleton-based Human Action Recognition

Graph Convolutional Networks (GCNs) have been widely used in skeleton-ba...
research
12/09/2022

Leveraging Spatio-Temporal Dependency for Skeleton-Based Action Recognition

Skeleton-based action recognition has attracted considerable attention d...
research
12/07/2019

Spatio-Temporal Pyramid Graph Convolutions for Human Action Recognition and Postural Assessment

Recognition of human actions and associated interactions with objects an...
research
06/30/2022

Timestamp-Supervised Action Segmentation with Graph Convolutional Networks

We introduce a novel approach for temporal activity segmentation with ti...
research
11/07/2020

On the spatial attention in Spatio-Temporal Graph Convolutional Networks for skeleton-based human action recognition

Graph convolutional networks (GCNs) achieved promising performance in sk...
research
08/07/2020

A Multi-Task Learning Approach for Human Action Detection and Ergonomics Risk Assessment

We propose a new approach to Human Action Evaluation (HAE) in long video...
research
12/01/2021

Graph Convolutional Module for Temporal Action Localization in Videos

Temporal action localization has long been researched in computer vision...

Please sign up or login with your details

Forgot password? Click here to reset