Delving into 3D Action Anticipation from Streaming Videos

06/15/2019
by   Hongsong Wang, et al.
0

Action anticipation, which aims to recognize the action with a partial observation, becomes increasingly popular due to a wide range of applications. In this paper, we investigate the problem of 3D action anticipation from streaming videos with the target of understanding best practices for solving this problem. We first introduce several complementary evaluation metrics and present a basic model based on frame-wise action classification. To achieve better performance, we then investigate two important factors, i.e., the length of the training clip and clip sampling method. We also explore multi-task learning strategies by incorporating auxiliary information from two aspects: the full action representation and the class-agnostic action label. Our comprehensive experiments uncover the best practices for 3D action anticipation, and accordingly we propose a novel method with a multi-task loss. The proposed method considerably outperforms the recent methods and exhibits the state-of-the-art performance on standard benchmarks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/06/2020

Online Action Detection in Streaming Videos with Time Buffers

We formulate the problem of online temporal action detection in live str...
research
02/19/2018

Online Action Detection in Untrimmed, Streaming Videos - Modeling and Evaluation

The goal of Online Action Detection (OAD) is to detect action in a timel...
research
12/22/2016

Efficient Action Detection in Untrimmed Videos via Multi-Task Learning

This paper studies the joint learning of action recognition and temporal...
research
04/27/2022

Human-Centered Prior-Guided and Task-Dependent Multi-Task Representation Learning for Action Recognition Pre-Training

Recently, much progress has been made for self-supervised action recogni...
research
01/23/2022

ASCNet: Action Semantic Consistent Learning of Arbitrary Progress Levels for Early Action Prediction

Early action prediction aims to recognize human actions from only a part...
research
02/01/2022

Multi-Order Networks for Action Unit Detection

Deep multi-task methods, where several tasks are learned within a single...
research
09/10/2021

PlaTe: Visually-Grounded Planning with Transformers in Procedural Tasks

In this work, we study the problem of how to leverage instructional vide...

Please sign up or login with your details

Forgot password? Click here to reset