Real-time Hand Gesture Detection and Classification Using Convolutional Neural Networks

01/29/2019
by   Okan Köpüklü, et al.
1

Real-time recognition of dynamic hand gestures from video streams is a challenging task since (i) there is no indication when a gesture starts and ends in the video, (ii) performed gestures should only be recognized once, and (iii) the entire architecture should be designed considering the memory and power budget. In this work, we address these challenges by proposing a hierarchical structure enabling offline-working convolutional neural network (CNN) architectures to operate online efficiently by using sliding window approach. The proposed architecture consists of two models: (1) A detector which is a lightweight CNN architecture to detect gestures and (2) a classifier which is a deep CNN to classify the detected gestures. In order to evaluate the single-time activations of the detected gestures, we propose to use the Levenshtein distance as an evaluation metric since it can measure misclassifications, multiple detections, and missing detections at the same time. We evaluate our architecture on two publicly available datasets - EgoGesture and NVIDIA Dynamic Hand Gesture Datasets - which require temporal detection and classification of the performed hand gestures. ResNeXt-101 model, which is used as a classifier, achieves the state-of-the-art offline classification accuracy of 94.04 and NVIDIA benchmarks, respectively. In real-time detection and classification, we obtain considerable early detections while achieving performances close to offline operation. The codes and pretrained models used in this work are publicly available.

READ FULL TEXT

page 1

page 3

research
03/02/2020

DriverMHG: A Multi-Modal Dataset for Dynamic Recognition of Driver Micro Hand Gestures and a Real-Time Recognition Framework

The use of hand gestures provides a natural alternative to cumbersome in...
research
09/11/2019

Comparative Analysis of CNN-based Spatiotemporal Reasoning in Videos

Understanding actions and gestures in video streams requires temporal re...
research
09/04/2020

A Deep Learning Approach to Tongue Detection for Pediatric Population

Children with severe disabilities and complex communication needs face l...
research
06/05/2020

sEMG Gesture Recognition with a Simple Model of Attention

Myoelectric control is one of the leading brain-machine-interfaces in th...
research
03/08/2017

Fast Gesture Recognition with Multiple Stream Discrete HMMs on 3D Skeletons

HMMs are widely used in action and gesture recognition due to their impl...
research
06/07/2023

CaptAinGlove: Capacitive and Inertial Fusion-Based Glove for Real-Time on Edge Hand Gesture Recognition for Drone Control

We present CaptAinGlove, a textile-based, low-power (1.15Watts), privacy...
research
05/05/2022

Deep Neural Network approaches for Analysing Videos of Music Performances

This paper presents a framework to automate the labelling process for ge...

Please sign up or login with your details

Forgot password? Click here to reset