Action Recognition with Deep Multiple Aggregation Networks

06/08/2020
by   Ahmed Mazari, et al.
0

Most of the current action recognition algorithms are based on deep networks which stack multiple convolutional, pooling and fully connected layers. While convolutional and fully connected operations have been widely studied in the literature, the design of pooling operations that handle action recognition, with different sources of temporal granularity in action categories, has comparatively received less attention, and existing solutions rely mainly on max or averaging operations. The latter are clearly powerless to fully exhibit the actual temporal granularity of action categories and thereby constitute a bottleneck in classification performances. In this paper, we introduce a novel hierarchical pooling design that captures different levels of temporal granularity in action recognition. Our design principle is coarse-to-fine and achieved using a tree-structured network; as we traverse this network top-down, pooling operations are getting less invariant but timely more resolute and well localized. Learning the combination of operations in this network – which best fits a given ground-truth – is obtained by solving a constrained minimization problem whose solution corresponds to the distribution of weights that capture the contribution of each level (and thereby temporal granularity) in the global hierarchical pooling process. Besides being principled and well grounded, the proposed hierarchical pooling is also video-length and resolution agnostic. Extensive experiments conducted on the challenging UCF-101, HMDB-51 and JHMDB-21 databases corroborate all these statements.

READ FULL TEXT

page 3

page 14

research
06/08/2020

Deep hierarchical pooling design for cross-granularity action recognition

In this paper, we introduce a novel hierarchical aggregation design that...
research
05/02/2019

Human Action Recognition with Deep Temporal Pyramids

Deep convolutional neural networks (CNNs) are nowadays achieving signifi...
research
10/15/2019

Human Action Recognition with Multi-Laplacian Graph Convolutional Networks

Convolutional neural networks are nowadays witnessing a major success in...
research
05/29/2019

Hierarchical Feature Aggregation Networks for Video Action Recognition

Most action recognition methods base on a) a late aggregation of frame l...
research
08/20/2019

Action recognition with spatial-temporal discriminative filter banks

Action recognition has seen a dramatic performance improvement in the la...
research
03/27/2023

Unified Keypoint-based Action Recognition Framework via Structured Keypoint Pooling

This paper simultaneously addresses three limitations associated with co...

Please sign up or login with your details

Forgot password? Click here to reset