A Multi-Task Learning Approach for Human Action Detection and Ergonomics Risk Assessment

08/07/2020
by   Behnoosh Parsa, et al.
0

We propose a new approach to Human Action Evaluation (HAE) in long videos using graph-based multi-task modeling. Previous works in activity assessment either directly compute a metric using a detected skeleton or use the scene information to regress the activity score. These approaches are insufficient for accurate activity assessment since they only compute an average score over a clip, and do not consider the correlation between the joints and body dynamics. Moreover, they are highly scene-dependent which makes the generalizability of these methods questionable. We propose a novel multi-task framework for HAE that utilizes a Graph Convolutional Network backbone to embed the interconnection between human joints in the features. In this framework, we solve the Human Action Detection (HAD) problem as an auxiliary task to improve activity assessment. The HAD head is powered by an Encoder-Decoder Temporal Convolutional Network to detect activities in long videos and HAE uses a Long-Short-Term-Memory-based architecture. We evaluate our method on the UW-IOM and TUM Kitchen datasets and discuss the success and failure cases on these two datasets.

READ FULL TEXT

page 5

page 7

page 8

research
09/13/2018

Part-based Graph Convolutional Network for Action Recognition

Human actions comprise of joint motion of articulated body parts or `ges...
research
08/10/2023

Local-Global Information Interaction Debiasing for Dynamic Scene Graph Generation

The task of dynamic scene graph generation (DynSGG) aims to generate sce...
research
05/28/2019

Autonomous Human Activity Classification from Ego-vision Camera and Accelerometer Data

There has been significant amount of research work on human activity cla...
research
11/26/2018

Stacked Spatio-Temporal Graph Convolutional Networks for Action Segmentation

We propose novel Stacked Spatio-Temporal Graph Convolutional Networks (S...
research
08/13/2020

Hybrid Dynamic-static Context-aware Attention Network for Action Assessment in Long Videos

The objective of action quality assessment is to score sports videos. Ho...
research
06/17/2019

Multi-task Learning For Detecting and Segmenting Manipulated Facial Images and Videos

Detecting manipulated images and videos is an important topic in digital...

Please sign up or login with your details

Forgot password? Click here to reset