3D Human Activity Recognition with Reconfigurable Convolutional Neural Networks

01/26/2015
by   Keze Wang, et al.
0

Human activity understanding with 3D/depth sensors has received increasing attention in multimedia processing and interactions. This work targets on developing a novel deep model for automatic activity recognition from RGB-D videos. We represent each human activity as an ensemble of cubic-like video segments, and learn to discover the temporal structures for a category of activities, i.e. how the activities to be decomposed in terms of classification. Our model can be regarded as a structured deep architecture, as it extends the convolutional neural networks (CNNs) by incorporating structure alternatives. Specifically, we build the network consisting of 3D convolutions and max-pooling operators over the video segments, and introduce the latent variables in each convolutional layer manipulating the activation of neurons. Our model thus advances existing approaches in two aspects: (i) it acts directly on the raw inputs (grayscale-depth data) to conduct recognition instead of relying on hand-crafted features, and (ii) the model structure can be dynamically adjusted accounting for the temporal variations of human activities, i.e. the network configuration is allowed to be partially activated during inference. For model training, we propose an EM-type optimization method that iteratively (i) discovers the latent structure by determining the decomposed actions for each training example, and (ii) learns the network parameters by using the back-propagation algorithm. Our approach is validated in challenging scenarios, and outperforms state-of-the-art methods. A large human activity database of RGB-D videos is presented in addition.

READ FULL TEXT

page 2

page 3

page 4

page 7

research
12/05/2015

A Deep Structured Model with Radius-Margin Bound for 3D Human Activity Recognition

Understanding human activity is very challenging even with the recently ...
research
07/09/2018

Human Activity Recognition in RGB-D Videos by Dynamic Images

Human Activity Recognition in RGB-D videos has been an active research t...
research
07/12/2021

Human-like Relational Models for Activity Recognition in Video

Video activity recognition by deep neural networks is impressive for man...
research
08/04/2012

Human Activity Learning using Object Affordances from RGB-D Videos

Human activities comprise several sub-activities performed in a sequence...
research
03/06/2015

Latent Hierarchical Model for Activity Recognition

We present a novel hierarchical model for human activity recognition. In...
research
11/07/2017

Latent hypernet: Exploring all Layers from Convolutional Neural Networks

Since Convolutional Neural Networks (ConvNets) are able to simultaneousl...
research
12/05/2017

Learning to Forecast Videos of Human Activity with Multi-granularity Models and Adaptive Rendering

We propose an approach for forecasting video of complex human activity i...

Please sign up or login with your details

Forgot password? Click here to reset