Res3ATN – Deep 3D Residual Attention Network for Hand Gesture Recognition in Videos

01/04/2020
by   Naina Dhingra, et al.
13

Hand gesture recognition is a strenuous task to solve in videos. In this paper, we use a 3D residual attention network which is trained end to end for hand gesture recognition. Based on the stacked multiple attention blocks, we build a 3D network which generates different features at each attention block. Our 3D attention based residual network (Res3ATN) can be built and extended to very deep layers. Using this network, an extensive analysis is performed on other 3D networks based on three publicly available datasets. The Res3ATN network performance is compared to C3D, ResNet-10, and ResNext-101 networks. We also study and evaluate our baseline network with different number of attention blocks. The comparison shows that the 3D residual attention network with 3 attention blocks is robust in attention learning and is able to classify the gestures with better accuracy, thus outperforming existing networks.

READ FULL TEXT

page 4

page 5

research
02/07/2022

Deep Residual Shrinkage Networks for EMG-based Gesture Identification

This work introduces a method for high-accuracy EMG based gesture identi...
research
06/14/2018

HGR-Net: A Two-stage Convolutional Neural Network for Hand Gesture Segmentation and Recognition

Robust recognition of hand gesture in real-world applications is still a...
research
10/30/2018

DeepGRU: Deep Gesture Recognition Utility

We introduce DeepGRU, a deep learning based gesture and action recognize...
research
09/18/2020

Residual Spatial Attention Network for Retinal Vessel Segmentation

Reliable segmentation of retinal vessels can be employed as a way of mon...
research
08/04/2021

Multi-Branch with Attention Network for Hand-Based Person Recognition

In this paper, we propose a novel hand-based person recognition method f...
research
09/18/2020

An Enhanced Convolutional Neural Network in Side-Channel Attacks and Its Visualization

In recent years, the convolutional neural networks (CNNs) have received ...
research
07/20/2022

ViGAT: Bottom-up event recognition and explanation in video using factorized graph attention network

In this paper a pure-attention bottom-up approach, called ViGAT, that ut...

Please sign up or login with your details

Forgot password? Click here to reset