Sequential Deep Trajectory Descriptor for Action Recognition with Three-stream CNN

09/10/2016
by   Yemin Shi, et al.
0

Learning the spatial-temporal representation of motion information is crucial to human action recognition. Nevertheless, most of the existing features or descriptors cannot capture motion information effectively, especially for long-term motion. To address this problem, this paper proposes a long-term motion descriptor called sequential Deep Trajectory Descriptor (sDTD). Specifically, we project dense trajectories into two-dimensional planes, and subsequently a CNN-RNN network is employed to learn an effective representation for long-term motion. Unlike the popular two-stream ConvNets, the sDTD stream is introduced into a three-stream framework so as to identify actions from a video sequence. Consequently, this three-stream framework can simultaneously capture static spatial features, short-term motion and long-term motion in the video. Extensive experiments were conducted on three challenging datasets: KTH, HMDB51 and UCF101. Experimental results show that our method achieves state-of-the-art performance on the KTH and UCF101 datasets, and is comparable to the state-of-the-art methods on the HMDB51 dataset.

READ FULL TEXT

page 5

page 6

page 8

research
02/26/2019

IF-TTN: Information Fused Temporal Transformation Network for Video Action Recognition

Effective spatiotemporal feature representation is crucial to the video-...
research
11/16/2016

Joint Network based Attention for Action Recognition

By extracting spatial and temporal characteristics in one network, the t...
research
05/24/2019

Deep Trajectory for Recognition of Human Behaviours

Identifying human actions in complex scenes is widely considered as a ch...
research
02/13/2015

Long-short Term Motion Feature for Action Classification and Retrieval

We propose a method for representing motion information for video classi...
research
09/10/2016

A Tube-and-Droplet-based Approach for Representing and Analyzing Motion Trajectories

Trajectory analysis is essential in many applications. In this paper, we...
research
04/11/2021

Temporal Consistency Two-Stream CNN for Human Motion Prediction

Fusion is critical for a two-stream network. In this paper, we propose a...
research
02/19/2018

Learning Representative Temporal Features for Action Recognition

In this paper we present a novel video classification methodology that a...

Please sign up or login with your details

Forgot password? Click here to reset