Robust Multimodal Fusion for Human Activity Recognition

03/08/2023
by   Sanju Xaviar, et al.
0

The proliferation of IoT and mobile devices equipped with heterogeneous sensors has enabled new applications that rely on the fusion of time-series data generated by multiple sensors with different modalities. While there are promising deep neural network architectures for multimodal fusion, their performance falls apart quickly in the presence of consecutive missing data and noise across multiple modalities/sensors, the issues that are prevalent in real-world settings. We propose Centaur, a multimodal fusion model for human activity recognition (HAR) that is robust to these data quality issues. Centaur combines a data cleaning module, which is a denoising autoencoder with convolutional layers, and a multimodal fusion module, which is a deep convolutional neural network with the self-attention mechanism to capture cross-sensor correlation. We train Centaur using a stochastic data corruption scheme and evaluate it on three datasets that contain data generated by multiple inertial measurement units. Centaur's data cleaning module outperforms 2 state-of-the-art autoencoder-based models and its multimodal fusion module outperforms 4 strong baselines. Compared to 2 related robust fusion architectures, Centaur is more robust, achieving 11.59-17.52 in HAR, especially in the presence of consecutive missing data in multiple sensor channels.

READ FULL TEXT

page 5

page 11

research
04/29/2020

EmbraceNet for Activity: A Deep Multimodal Fusion Architecture for Activity Recognition

Human activity recognition using multiple sensors is a challenging but p...
research
08/03/2020

HAMLET: A Hierarchical Multimodal Attention-based Human Activity Recognition Algorithm

To fluently collaborate with people, robots need the ability to recogniz...
research
12/30/2019

SelectFusion: A Generic Framework to Selectively Learn Multisensory Fusion

Autonomous vehicles and mobile robotic systems are typically equipped wi...
research
11/01/2018

PerceptionNet: A Deep Convolutional Neural Network for Late Sensor Fusion

Human Activity Recognition (HAR) based on motion sensors has drawn a lot...
research
02/04/2017

Probabilistic Sensor Fusion for Ambient Assisted Living

There is a widely-accepted need to revise current forms of health-care p...
research
10/19/2022

MMRNet: Improving Reliability for Multimodal Computer Vision for Bin Picking via Multimodal Redundancy

Recently, there has been tremendous interest in industry 4.0 infrastruct...

Please sign up or login with your details

Forgot password? Click here to reset