Infrared and 3D skeleton feature fusion for RGB-D action recognition

02/28/2020
by   Alban Main de Boissiere, et al.
6

A challenge of skeleton-based action recognition is the difficulty to classify actions with similar motions and object-related actions. Visual clues from other streams help in that regard. RGB data are sensible to illumination conditions, thus unusable in the dark. To alleviate this issue and still benefit from a visual stream, we propose a modular network (FUSION) combining skeleton and infrared data. A 2D convolutional neural network (CNN) is used as a pose module to extract features from skeleton data. A 3D CNN is used as an infrared module to extract visual cues from videos. Both feature vectors are then concatenated and exploited conjointly using a multilayer perceptron (MLP). Skeleton data also condition the infrared videos, providing a crop around the performing subjects and thus virtually focusing the attention of the infrared module. Ablation studies show that using pre-trained networks on other large scale datasets as our modules and data augmentation yield considerable improvements on the action classification accuracy. The strong contribution of our cropping strategy is also demonstrated. We evaluate our method on the NTU RGB+D dataset, the largest dataset for human action recognition from depth cameras, and report state-of-the-art performances.

READ FULL TEXT

page 1

page 4

page 5

research
07/04/2020

Quo Vadis, Skeleton Action Recognition ?

In this paper, we study current and upcoming frontiers across the landsc...
research
05/23/2017

Two-Stream 3D Convolutional Neural Network for Skeleton-Based Action Recognition

It remains a challenge to efficiently extract spatialtemporal informatio...
research
08/06/2022

AFE-CNN: 3D Skeleton-based Action Recognition with Action Feature Enhancement

Existing 3D skeleton-based action recognition approaches reach impressiv...
research
02/23/2022

Skeleton Sequence and RGB Frame Based Multi-Modality Feature Fusion Network for Action Recognition

Action recognition has been a heated topic in computer vision for its wi...
research
07/18/2018

Signal Alignment for Humanoid Skeletons via the Globally Optimal Reparameterization Algorithm

The general ability to analyze and classify the 3D kinematics of the hum...
research
04/20/2022

FenceNet: Fine-grained Footwork Recognition in Fencing

Current data analysis for the Canadian Olympic fencing team is primarily...
research
07/30/2020

Hierarchical Action Classification with Network Pruning

Research on human action classification has made significant progresses ...

Please sign up or login with your details

Forgot password? Click here to reset