Temporal Reasoning Graph for Activity Recognition

08/27/2019
by   Jingran Zhang, et al.
38

Despite great success has been achieved in activity analysis, it still has many challenges. Most existing work in activity recognition pay more attention to design efficient architecture or video sampling strategy. However, due to the property of fine-grained action and long term structure in video, activity recognition is expected to reason temporal relation between video sequences. In this paper, we propose an efficient temporal reasoning graph (TRG) to simultaneously capture the appearance features and temporal relation between video sequences at multiple time scales. Specifically, we construct learnable temporal relation graphs to explore temporal relation on the multi-scale range. Additionally, to facilitate multi-scale temporal relation extraction, we design a multi-head temporal adjacent matrix to represent multi-kinds of temporal relations. Eventually, a multi-head temporal relation aggregator is proposed to extract the semantic meaning of those features convolving through the graphs. Extensive experiments are performed on widely-used large-scale datasets, such as Something-Something and Charades, and the results show that our model can achieve state-of-the-art performance. Further analysis shows that temporal relation reasoning with our TRG can extract discriminative features for activity recognition.

READ FULL TEXT

page 1

page 3

page 12

research
04/23/2019

Learning Actor Relation Graphs for Group Activity Recognition

Modeling relation between actors is important for recognizing group acti...
research
12/11/2021

COMPOSER: Compositional Learning of Group Activity in Videos

Group Activity Recognition (GAR) detects the activity performed by a gro...
research
08/08/2019

Progressive Relation Learning for Group Activity Recognition

Group activities usually involve spatio-temporal dynamics among many int...
research
08/22/2018

Deep Adaptive Temporal Pooling for Activity Recognition

Deep neural networks have recently achieved competitive accuracy for hum...
research
08/07/2018

Dynamic Temporal Pyramid Network: A Closer Look at Multi-Scale Modeling for Activity Detection

Recognizing instances at different scales simultaneously is a fundamenta...
research
03/18/2019

Human Activity Recognition for Edge Devices

Video activity Recognition has recently gained a lot of momentum with th...
research
09/22/2022

FuTH-Net: Fusing Temporal Relations and Holistic Features for Aerial Video Classification

Unmanned aerial vehicles (UAVs) are now widely applied to data acquisiti...

Please sign up or login with your details

Forgot password? Click here to reset