Context-LSTM: a robust classifier for video detection on UCF101

03/13/2022
by   Dengshan Li, et al.
0

Video detection and human action recognition may be computationally expensive, and need a long time to train models. In this paper, we were intended to reduce the training time and the GPU memory usage of video detection, and achieved a competitive detection accuracy. Other research works such as Two-stream, C3D, TSN have shown excellent performance on UCF101. Here, we used a LSTM structure simply for video detection. We used a simple structure to perform a competitive top-1 accuracy on the entire validation dataset of UCF101. The LSTM structure is named Context-LSTM, since it may process the deep temporal features. The Context-LSTM may simulate the human recognition system. We cascaded the LSTM blocks in PyTorch and connected the cell state flow and hidden output flow. At the connection of the blocks, we used ReLU, Batch Normalization, and MaxPooling functions. The Context-LSTM could reduce the training time and the GPU memory usage, while keeping a state-of-the-art top-1 accuracy on UCF101 entire validation dataset, show a robust performance on video action detection.

READ FULL TEXT

page 5

page 9

page 10

research
07/18/2017

Skeleton Based Human Action Recognition with Global Context-Aware Attention LSTM Networks

Human action recognition in 3D skeleton sequences has attracted a lot of...
research
06/05/2022

3D Convolutional with Attention for Action Recognition

Human action recognition is one of the challenging tasks in computer vis...
research
04/04/2017

Two Stream LSTM: A Deep Fusion Framework for Human Action Recognition

In this paper we address the problem of human action recognition from vi...
research
06/20/2022

Practical Deepfake Detection: Vulnerabilities in Global Contexts

Recent advances in deep learning have enabled realistic digital alterati...
research
02/19/2020

Human Action Recognition using Local Two-Stream Convolution Neural Network Features and Support Vector Machines

This paper proposes a simple yet effective method for human action recog...
research
07/18/2021

A stepped sampling method for video detection using LSTM

Artificial neural networks that simulate human achieves great successes....
research
11/19/2021

Fast and Data-Efficient Training of Rainbow: an Experimental Study on Atari

Across the Arcade Learning Environment, Rainbow achieves a level of perf...

Please sign up or login with your details

Forgot password? Click here to reset