Gimme Signals: Discriminative signal encoding for multimodal activity recognition

03/13/2020
by   Raphael Memmesheimer, et al.
0

We present a simple, yet effective and flexible method for action recognition supporting multiple sensor modalities. Multivariate signal sequences are encoded in an image and are then classified using a recently proposed EfficientNet CNN architecture. Our focus was to find an approach that generalizes well across different sensor modalities without specific adaptions while still achieving good results. We apply our method to 4 action recognition datasets containing skeleton sequences, inertial and motion capturing measurements as well as fingerprints that range up to 120 action classes. Our method defines the current best CNN-based approach on the NTU RGB+D 120 dataset, lifts the state of the art on the ARIL Wi-Fi dataset by +6.78 improves the UTD-MHAD inertial baseline by +14.4 baseline by 1.13 (80/20 split). We further demonstrate experiments on both, modality fusion on a signal level and signal reduction to prevent the representation from overloading.

READ FULL TEXT
research
09/27/2021

Fusion-GCN: Multimodal Action Recognition using Graph Convolutional Networks

In this paper, we present Fusion-GCN, an approach for multimodal action ...
research
05/28/2021

Inertial Sensor Data To Image Encoding For Human Action Recognition

Convolutional Neural Networks (CNNs) are successful deep learning models...
research
04/23/2020

Signal Level Deep Metric Learning for Multimodal One-Shot Action Recognition

Recognizing an activity with a single reference sample using metric lear...
research
08/22/2020

Towards Improved Human Action Recognition Using Convolutional Neural Networks and Multimodal Fusion of Depth and Inertial Sensor Data

This paper attempts at improving the accuracy of Human Action Recognitio...
research
10/29/2020

CNN based Multistage Gated Average Fusion (MGAF) for Human Action Recognition Using Depth and Inertial Sensors

Convolutional Neural Network (CNN) provides leverage to extract and fuse...
research
08/10/2023

Ensemble Modeling for Multimodal Visual Action Recognition

In this work, we propose an ensemble modeling approach for multimodal ac...
research
07/18/2018

Signal Alignment for Humanoid Skeletons via the Globally Optimal Reparameterization Algorithm

The general ability to analyze and classify the 3D kinematics of the hum...

Please sign up or login with your details

Forgot password? Click here to reset