Action Recognition for Depth Video using Multi-view Dynamic Images

06/29/2018
by   Yang Xiao, et al.
6

Dynamic image is the recently emerged action representation paradigm able to compactly capture the temporal evolution, especially in context of deep Convolutional Neural Network(CNN). Inspired by its preliminary success towards RGB videos, we propose its extension to the depth domain. To better exploit the 3D characteristics of depth video to leverage the performance, multi-view dynamic image is proposed by us. In particular, the raw depth video will be densely projected onto the different imaging view-points by rotating the virtual camera around the specific instances within the 3D space. Dynamic images are then extracted from the yielded multi-view depth videos respectively to constitute the multi-view dynamic images. In this way, more view-tolerant representative information can be involved in multiview dynamic images than the single-view counterpart. A novel CNN learning model is consequently proposed to execute feature learning on multi-view dynamic images. The dynamic images from different views will share the same convolutional layers, but with the different fully-connected layers. This model aims to enhance the tuning of shallow convolutional layers by alleviating gradient vanishing. Furthermore, to address the effect of spatial variation an action proposal method based on faster R-CNN is proposed. The dynamic images will be extracted only from the action proposal regions. In experiments, our approach can achieve the state-of-the-art performance on 3 challenging datasets (i.e., NTU RGB-D, Northwestern-UCLA and UWA3DII).

READ FULL TEXT

page 2

page 3

page 4

page 5

page 7

page 8

page 9

page 10

research
04/12/2019

Multi-View Region Adaptive Multi-temporal DMM and RGB Action Recognition

Human action recognition remains an important yet challenging task. This...
research
04/24/2019

A Large-scale Varying-view RGB-D Action Dataset for Arbitrary-view Human Action Recognition

Current researches of action recognition mainly focus on single-view and...
research
06/15/2016

DeepProposals: Hunting Objects and Actions by Cascading Deep Convolutional Layers

In this paper, a new method for generating object and action proposals i...
research
07/03/2020

ODE-CNN: Omnidirectional Depth Extension Networks

Omnidirectional 360 camera proliferates rapidly for autonomous robots si...
research
04/14/2023

NEV-NCD: Negative Learning, Entropy, and Variance regularization based novel action categories discovery

Novel Categories Discovery (NCD) facilitates learning from a partially a...
research
03/12/2019

Image Classification base on PCA of Multi-view Deep Representation

In the age of information explosion, image classification is the key tec...
research
08/25/2019

Texture and Structure Two-view Classification of Images

Textural and structural features can be regraded as "two-view" feature s...

Please sign up or login with your details

Forgot password? Click here to reset