Human Action Recognition using Local Two-Stream Convolution Neural Network Features and Support Vector Machines

02/19/2020
by   David Torpey, et al.
0

This paper proposes a simple yet effective method for human action recognition in video. The proposed method separately extracts local appearance and motion features using state-of-the-art three-dimensional convolutional neural networks from sampled snippets of a video. These local features are then concatenated to form global representations which are then used to train a linear SVM to perform the action classification using full context of the video, as partial context as used in previous works. The videos undergo two simple proposed preprocessing techniques, optical flow scaling and crop filling. We perform an extensive evaluation on three common benchmark dataset to empirically show the benefit of the SVM, and the two preprocessing steps.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/18/2019

Global Temporal Representation based CNNs for Infrared Action Recognition

Infrared human action recognition has many advantages, i.e., it is insen...
research
10/01/2013

Combining Spatio-Temporal Appearance Descriptors and Optical Flow for Human Action Recognition in Video Data

This paper proposes combining spatio-temporal appearance (STA) descripto...
research
07/25/2019

A Novel Approach for Robust Multi Human Action Detection and Recognition based on 3-Dimentional Convolutional Neural Networks

In recent years, various attempts have been proposed to explore the use ...
research
07/21/2017

Multi-kernel learning of deep convolutional features for action recognition

Image understanding using deep convolutional network has reached human-l...
research
06/07/2019

Video Modeling with Correlation Networks

Motion is a salient cue to recognize actions in video. Modern action rec...
research
01/25/2017

Deep Local Video Feature for Action Recognition

We investigate the problem of representing an entire video using CNN fea...
research
03/13/2022

Context-LSTM: a robust classifier for video detection on UCF101

Video detection and human action recognition may be computationally expe...

Please sign up or login with your details

Forgot password? Click here to reset