Collaborative Attention Mechanism for Multi-View Action Recognition

09/14/2020
by   Yue Bai, et al.
0

Multi-view action recognition (MVAR) leverages complementary temporal information from different views to enhance the learning process. Attention is an effective mechanism which has been extensively adopted for modeling temporal data. However, most existing MVAR methods only utilize attention to extract view-specific patterns. They ignore the potential to dig latent mutual-support information inattention space. To fully take the advantage of the multi-view cooperation, we propose a collaborative attention mechanism (CAM). It detects the attention differences among multi-view inputs, and adaptively integrates complementary frame-level information to benefit each other. Specifically, we utilize recurrent neural network (RNN) by expanding long short-term memory (LSTM) as a Mutual-Aid RNN (MAR). CAM takes advantages of view-specific attention pattern to guide another view and unlock potential information which is hard to explore by itself. Extensive experiments on three action datasets illustrate our CAM achieves better result for each single view, and also boosts the multi-view performance.

READ FULL TEXT

page 2

page 11

page 12

research
10/30/2018

Long Short-Term Attention

In order to learn effective features from temporal sequences, the long s...
research
01/01/2022

Self-attention Multi-view Representation Learning with Diversity-promoting Complementarity

Multi-view learning attempts to generate a model with a better performan...
research
02/03/2018

Memory Fusion Network for Multi-view Sequential Learning

Multi-view sequential learning is a fundamental problem in machine learn...
research
05/29/2022

3D-C2FT: Coarse-to-fine Transformer for Multi-view 3D Reconstruction

Recently, the transformer model has been successfully employed for the m...
research
08/25/2017

Hierarchical Multi-scale Attention Networks for Action Recognition

Recurrent Neural Networks (RNNs) have been widely used in natural langua...
research
11/12/2020

Multi-View Dynamic Heterogeneous Information Network Embedding

Most existing Heterogeneous Information Network (HIN) embedding methods ...
research
10/29/2018

ActionXPose: A Novel 2D Multi-view Pose-based Algorithm for Real-time Human Action Recognition

We present ActionXPose, a novel 2D pose-based algorithm for posture-leve...

Please sign up or login with your details

Forgot password? Click here to reset