Efficient sign language recognition system and dataset creation method based on deep learning and image processing

New deep-learning architectures are created every year, achieving state-of-the-art results in image recognition and leading to the belief that, in a few years, complex tasks such as sign language translation will be considerably easier, serving as a communication tool for the hearing-impaired community. On the other hand, these algorithms still need a lot of data to be trained and the dataset creation process is expensive, time-consuming, and slow. Thereby, this work aims to investigate techniques of digital image processing and machine learning that can be used to create a sign language dataset effectively. We argue about data acquisition, such as the frames per second rate to capture or subsample the videos, the background type, preprocessing, and data augmentation, using convolutional neural networks and object detection to create an image classifier and comparing the results based on statistical tests. Different datasets were created to test the hypotheses, containing 14 words used daily and recorded by different smartphones in the RGB color system. We achieved an accuracy of 96.38 the validation set containing more challenging conditions, showing that 30 FPS is the best frame rate subsample to train the classifier, geometric transformations work better than intensity transformations, and artificial background creation is not effective to model generalization. These trade-offs should be considered in future work as a cost-benefit guideline between computational cost and accuracy gain when creating a dataset and training a sign recognition model.

READ FULL TEXT

page 3

page 7

page 8

page 9

research
08/14/2022

BDSL 49: A Comprehensive Dataset of Bangla Sign Language

Language is a method by which individuals express their thoughts. Each l...
research
01/06/2023

Design of Arabic Sign Language Recognition Model

Deaf people are using sign language for communication, and it is a combi...
research
01/05/2022

Sign Language Recognition System using TensorFlow Object Detection API

Communication is defined as the act of sharing or exchanging information...
research
03/31/2023

Traffic Sign Recognition Dataset and Data Augmentation

Although there are many datasets for traffic sign classification, there ...
research
02/02/2017

Deep Learning the Indus Script

Standardized corpora of undeciphered scripts, a necessary starting point...
research
08/22/2023

CNN based Cuneiform Sign Detection Learned from Annotated 3D Renderings and Mapped Photographs with Illumination Augmentation

Motivated by the challenges of the Digital Ancient Near Eastern Studies ...
research
12/18/2020

Transfer Learning Based Automatic Model Creation Tool For Resource Constraint Devices

With the enhancement of Machine Learning, many tools are being designed ...

Please sign up or login with your details

Forgot password? Click here to reset