Self-Supervised Multimodal Fusion Transformer for Passive Activity Recognition

08/15/2022
by   Armand K. Koupai, et al.
0

The pervasiveness of Wi-Fi signals provides significant opportunities for human sensing and activity recognition in fields such as healthcare. The sensors most commonly used for passive Wi-Fi sensing are based on passive Wi-Fi radar (PWR) and channel state information (CSI) data, however current systems do not effectively exploit the information acquired through multiple sensors to recognise the different activities. In this paper, we explore new properties of the Transformer architecture for multimodal sensor fusion. We study different signal processing techniques to extract multiple image-based features from PWR and CSI data such as spectrograms, scalograms and Markov transition field (MTF). We first propose the Fusion Transformer, an attention-based model for multimodal and multi-sensor fusion. Experimental results show that our Fusion Transformer approach can achieve competitive results compared to a ResNet architecture but with much fewer resources. To further improve our model, we propose a simple and effective framework for multimodal and multi-sensor self-supervised learning (SSL). The self-supervised Fusion Transformer outperforms the baselines, achieving a F1-score of 95.9 this approach significantly outperforms the others when trained with as little as 1 training data.

READ FULL TEXT
research
04/29/2020

EmbraceNet for Activity: A Deep Multimodal Fusion Architecture for Activity Recognition

Human activity recognition using multiple sensors is a challenging but p...
research
09/23/2017

Self-supervised learning: When is fusion of the primary and secondary sensor cue useful?

Self-supervised learning (SSL) is a reliable learning mechanism in which...
research
04/19/2021

Self-Supervised WiFi-Based Activity Recognition

Traditional approaches to activity recognition involve the use of wearab...
research
02/24/2023

Streamlining Multimodal Data Fusion in Wireless Communication and Sensor Networks

This paper presents a novel approach for multimodal data fusion based on...
research
08/03/2020

HAMLET: A Hierarchical Multimodal Attention-based Human Activity Recognition Algorithm

To fluently collaborate with people, robots need the ability to recogniz...
research
04/12/2022

AutoFi: Towards Automatic WiFi Human Sensing via Geometric Self-Supervised Learning

WiFi sensing technology has shown superiority in smart homes among vario...
research
09/21/2023

Multimodal Transformers for Wireless Communications: A Case Study in Beam Prediction

Wireless communications at high-frequency bands with large antenna array...

Please sign up or login with your details

Forgot password? Click here to reset