Adaptive Recursive Circle Framework for Fine-grained Action Recognition

07/25/2021
by   Hanxi Lin, et al.
0

How to model fine-grained spatial-temporal dynamics in videos has been a challenging problem for action recognition. It requires learning deep and rich features with superior distinctiveness for the subtle and abstract motions. Most existing methods generate features of a layer in a pure feedforward manner, where the information moves in one direction from inputs to outputs. And they rely on stacking more layers to obtain more powerful features, bringing extra non-negligible overheads. In this paper, we propose an Adaptive Recursive Circle (ARC) framework, a fine-grained decorator for pure feedforward layers. It inherits the operators and parameters of the original layer but is slightly different in the use of those operators and parameters. Specifically, the input of the layer is treated as an evolving state, and its update is alternated with the feature generation. At each recursive step, the input state is enriched by the previously generated features and the feature generation is made with the newly updated input state. We hope the ARC framework can facilitate fine-grained action recognition by introducing deeply refined features and multi-scale receptive fields at a low cost. Significant improvements over feedforward baselines are observed on several benchmarks. For example, an ARC-equipped TSM-ResNet18 outperforms TSM-ResNet50 with 48 FLOPs and 52

READ FULL TEXT

page 2

page 12

page 13

page 14

research
11/04/2017

Attentional Pooling for Action Recognition

We introduce a simple yet surprisingly powerful model to incorporate att...
research
06/30/2022

Spatial Transformer Network with Transfer Learning for Small-scale Fine-grained Skeleton-based Tai Chi Action Recognition

Human action recognition is a quite hugely investigated area where most ...
research
05/19/2020

Associating Multi-Scale Receptive Fields for Fine-grained Recognition

Extracting and fusing part features have become the key of fined-grained...
research
02/24/2022

Slow-Fast Visual Tempo Learning for Video-based Action Recognition

Action visual tempo characterizes the dynamics and the temporal scale of...
research
08/03/2022

Combined CNN Transformer Encoder for Enhanced Fine-grained Human Action Recognition

Fine-grained action recognition is a challenging task in computer vision...
research
08/20/2019

Action recognition with spatial-temporal discriminative filter banks

Action recognition has seen a dramatic performance improvement in the la...
research
12/13/2021

Multi-Expert Human Action Recognition with Hierarchical Super-Class Learning

In still image human action recognition, existing studies have mainly le...

Please sign up or login with your details

Forgot password? Click here to reset