Squeeze-and-Excitation on Spatial and Temporal Deep Feature Space for Action Recognition

06/02/2018
by   Gaoyun An, et al.
0

Spatial and temporal features are two key and complementary information for human action recognition. In order to make full use of the intra-frame spatial characteristics and inter-frame temporal relationships, we propose the Squeeze-and-Excitation Long-term Recurrent Convolutional Networks (SE-LRCN) for human action recognition. The Squeeze and Excitation operations are used to implement the feature recalibration. In SE-LRCN, Squeeze-and-Excitation ResNet-34 (SE-ResNet-34) network is adopted to extract spatial features to enhance the dependencies and importance of feature channels of pixel granularity. We also propose the Squeeze-and-Excitation Long Short-Term Memory (SE-LSTM) network to model the temporal relationship, and to enhance the dependencies and importance of feature channels of frame granularity. We evaluate the proposed model on two challenging benchmarks, HMDB51 and UCF101, and the proposed SE-LRCN achieves the competitive results with the state-of-the-art.

READ FULL TEXT

page 6

page 8

research
11/18/2016

An End-to-End Spatio-Temporal Attention Model for Human Action Recognition from Skeleton Data

Human action recognition is an important task in computer vision. Extrac...
research
01/19/2023

Revisiting the Spatial and Temporal Modeling for Few-shot Action Recognition

Spatial and temporal modeling is one of the most core aspects of few-sho...
research
07/11/2021

Interpretable Deep Feature Propagation for Early Action Recognition

Early action recognition (action prediction) from limited preliminary ob...
research
04/23/2021

Modeling long-term interactions to enhance action recognition

In this paper, we propose a new approach to under-stand actions in egoce...
research
04/18/2019

DDLSTM: Dual-Domain LSTM for Cross-Dataset Action Recognition

Domain alignment in convolutional networks aims to learn the degree of l...
research
01/08/2018

Long-term Multi-granularity Deep Framework for Driver Drowsiness Detection

For real-world driver drowsiness detection from videos, the variation of...
research
12/06/2020

Representaciones del aprendizaje reutilizando los gradientes de la retropropagacion

This work proposes an algorithm for taking advantage of backpropagation ...

Please sign up or login with your details

Forgot password? Click here to reset