Towards Robust Human Activity Recognition from RGB Video Stream with Limited Labeled Data

12/16/2018
by   Krishanu Sarker, et al.
0

Human activity recognition based on video streams has received numerous attentions in recent years. Due to lack of depth information, RGB video based activity recognition performs poorly compared to RGB-D video based solutions. On the other hand, acquiring depth information, inertia etc. is costly and requires special equipment, whereas RGB video streams are available in ordinary cameras. Hence, our goal is to investigate whether similar or even higher accuracy can be achieved with RGB-only modality. In this regard, we propose a novel framework that couples skeleton data extracted from RGB video and deep Bidirectional Long Short Term Memory (BLSTM) model for activity recognition. A big challenge of training such a deep network is the limited training data, and exploring RGB-only stream significantly exaggerates the difficulty. We therefore propose a set of algorithmic techniques to train this model effectively, e.g., data augmentation, dynamic frame dropout and gradient injection. The experiments demonstrate that our RGB-only solution surpasses the state-of-the-art approaches that all exploit RGB-D video streams by a notable margin. This makes our solution widely deployable with ordinary cameras.

READ FULL TEXT

page 5

page 6

research
04/29/2020

Skeleton Focused Human Activity Recognition in RGB Video

The data-driven approach that learns an optimal representation of vision...
research
07/09/2018

Human Activity Recognition in RGB-D Videos by Dynamic Images

Human Activity Recognition in RGB-D videos has been an active research t...
research
03/01/2022

Multi-stage RGB-based Transfer Learning Pipeline for Hand Activity Recognition

First-person hand activity recognition is a challenging task, especially...
research
08/25/2017

Batch-Based Activity Recognition from Egocentric Photo-Streams

Activity recognition from long unstructured egocentric photo-streams has...
research
12/11/2021

COMPOSER: Compositional Learning of Group Activity in Videos

Group Activity Recognition (GAR) detects the activity performed by a gro...
research
08/20/2021

Few Shot Activity Recognition Using Variational Inference

There has been a remarkable progress in learning a model which could rec...
research
12/07/2022

DroneAttention: Sparse Weighted Temporal Attention for Drone-Camera Based Activity Recognition

Human activity recognition (HAR) using drone-mounted cameras has attract...

Please sign up or login with your details

Forgot password? Click here to reset