Human Action Sequence Classification

10/07/2019
by   Yan Bin Ng, et al.
8

This paper classifies human action sequences from videos using a machine translation model. In contrast to classical human action classification which outputs a set of actions, our method output a sequence of action in the chronological order of the actions performed by the human. Therefore our method is evaluated using sequential performance measures such as Bilingual Evaluation Understudy (BLEU) scores. Action sequence classification has many applications such as learning from demonstration, action segmentation, detection, localization and video captioning. Furthermore, we use our model that is trained to output action sequences to solve downstream tasks; such as video captioning and action localization. We obtain state of the art results for video captioning in challenging Charades dataset obtaining BLEU-4 score of 34.8 and METEOR score of 33.6 outperforming previous state-of-the-art of 18.8 and 19.5 respectively. Similarly, on ActivityNet captioning, we obtain excellent results in-terms of ROUGE (20.24) and CIDER (37.58) scores. For action localization, without using any explicit start/end action annotations, our method obtains localization performance of 22.2 mAP outperforming prior fully supervised methods.

READ FULL TEXT

page 1

page 2

page 7

research
04/24/2018

Fine-grained Video Classification and Captioning

We describe a DNN for fine-grained action classification and video capti...
research
06/13/2017

Action Search: Learning to Search for Human Activities in Untrimmed Videos

Traditional approaches for action detection use trimmed data to learn so...
research
09/01/2022

Unified Fully and Timestamp Supervised Temporal Action Segmentation via Sequence to Sequence Translation

This paper introduces a unified framework for video action segmentation ...
research
10/24/2019

LPAT: Learning to Predict Adaptive Threshold for Weakly-supervised Temporal Action Localization

Recently, Weakly-supervised Temporal Action Localization (WTAL) has been...
research
09/10/2019

Learning Actions from Human Demonstration Video for Robotic Manipulation

Learning actions from human demonstration is an emerging trend for desig...
research
04/07/2021

The Use of Video Captioning for Fostering Physical Activity

Video Captioning is considered to be one of the most challenging problem...
research
10/16/2019

Imperial College London Submission to VATEX Video Captioning Task

This paper describes the Imperial College London team's submission to th...

Please sign up or login with your details

Forgot password? Click here to reset