Vision Transformer with Convolutional Encoder-Decoder for Hand Gesture Recognition using 24 GHz Doppler Radar

09/12/2022
by   Kavinda Kehelella, et al.
0

Transformers combined with convolutional encoders have been recently used for hand gesture recognition (HGR) using micro-Doppler signatures. We propose a vision-transformer-based architecture for HGR with multi-antenna continuous-wave Doppler radar receivers. The proposed architecture consists of three modules: a convolutional encoderdecoder, an attention module with three transformer layers, and a multi-layer perceptron. The novel convolutional decoder helps to feed patches with larger sizes to the attention module for improved feature extraction. Experimental results obtained with a dataset corresponding to a two-antenna continuous-wave Doppler radar receiver operating at 24 GHz (published by Skaria et al.) confirm that the proposed architecture achieves an accuracy of 98.3 state-of-the-art on the used dataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/09/2021

PE-former: Pose Estimation Transformer

Vision transformer architectures have been demonstrated to work very eff...
research
04/30/2023

TransCAR: Transformer-based Camera-And-Radar Fusion for 3D Object Detection

Despite radar's popularity in the automotive industry, for fusion-based ...
research
09/18/2023

Gesture Recognition in Millimeter-Wave Radar Based on Spatio-Temporal Feature Sequences

Gesture recognition is a pivotal technology in the realm of intelligent ...
research
02/24/2023

A Convolutional Vision Transformer for Semantic Segmentation of Side-Scan Sonar Data

Distinguishing among different marine benthic habitat characteristics is...
research
06/18/2020

Multi-Encoder-Decoder Transformer for Code-Switching Speech Recognition

Code-switching (CS) occurs when a speaker alternates words of two or mor...
research
11/08/2022

Eat-Radar: Continuous Fine-Grained Eating Gesture Detection Using FMCW Radar and 3D Temporal Convolutional Network

Unhealthy dietary habits are considered as the primary cause of multiple...
research
11/04/2022

RCDPT: Radar-Camera fusion Dense Prediction Transformer

Recently, transformer networks have outperformed traditional deep neural...

Please sign up or login with your details

Forgot password? Click here to reset