Theater Aid System for the Visually Impaired Through Transfer Learning of Spatio-Temporal Graph Convolution Networks

06/28/2023
by   Leyla Benhamida, et al.
0

The aim of this research is to recognize human actions performed on stage to aid visually impaired and blind individuals. To achieve this, we have created a theatre human action recognition system that uses skeleton data captured by depth image as input. We collected new samples of human actions in a theatre environment, and then tested the transfer learning technique with three pre-trained Spatio-Temporal Graph Convolution Networks for skeleton-based human action recognition: the spatio-temporal graph convolution network, the two-stream adaptive graph convolution network, and the multi-scale disentangled unified graph convolution network. We selected the NTU-RGBD human action benchmark as the source domain and used our collected dataset as the target domain. We analyzed the transferability of the pre-trained models and proposed two configurations to apply and adapt the transfer learning technique to the diversity between the source and target domains. The use of transfer learning helped to improve the performance of the human action system within the context of theatre. The results indicate that Spatio-Temporal Graph Convolution Networks is positively transferred, and there was an improvement in performance compared to the baseline without transfer learning.

READ FULL TEXT

page 9

page 13

research
04/23/2018

Memory Attention Networks for Skeleton-based Action Recognition

Skeleton-based action recognition task is entangled with complex spatio-...
research
10/14/2022

Trailers12k: Evaluating Transfer Learning for Movie Trailer Genre Classification

Transfer learning is a cornerstone for a wide range of computer vision p...
research
01/12/2022

Semantic Labeling of Human Action For Visually Impaired And Blind People Scene Interaction

The aim of this work is to contribute to the development of a tactile de...
research
03/15/2021

Improving Generalization of Transfer Learning Across Domains Using Spatio-Temporal Features in Autonomous Driving

Training vision-based autonomous driving in the real world can be ineffi...
research
05/31/2019

3DPalsyNet: A Facial Palsy Grading and Motion Recognition Framework using Fully 3D Convolutional Neural Networks

The capability to perform facial analysis from video sequences has signi...
research
02/27/2018

Spatio-Temporal Graph Convolution for Skeleton Based Action Recognition

Variations of human body skeletons may be considered as dynamic graphs, ...
research
07/22/2019

Domain-Specific Priors and Meta Learning for Low-shot First-Person Action Recognition

The lack of large-scale real datasets with annotationsmakes transfer lea...

Please sign up or login with your details

Forgot password? Click here to reset