Learning to Refactor Action and Co-occurrence Features for Temporal Action Localization

06/23/2022
by   Kun Xia, et al.
0

The main challenge of Temporal Action Localization is to retrieve subtle human actions from various co-occurring ingredients, e.g., context and background, in an untrimmed video. While prior approaches have achieved substantial progress through devising advanced action detectors, they still suffer from these co-occurring ingredients which often dominate the actual action content in videos. In this paper, we explore two orthogonal but complementary aspects of a video snippet, i.e., the action features and the co-occurrence features. Especially, we develop a novel auxiliary task by decoupling these two types of features within a video snippet and recombining them to generate a new feature representation with more salient action information for accurate action localization. We term our method RefactorNet, which first explicitly factorizes the action content and regularizes its co-occurrence features, and then synthesizes a new action-dominated video representation. Extensive experimental results and ablation studies on THUMOS14 and ActivityNet v1.3 demonstrate that our new representation, combined with a simple action detector, can significantly improve the action localization performance.

READ FULL TEXT

page 1

page 3

page 4

page 7

page 8

research
03/30/2021

Weakly Supervised Temporal Action Localization Through Learning Explicit Subspaces for Action and Context

Weakly-supervised Temporal Action Localization (WS-TAL) methods learn to...
research
05/25/2019

Exploring Feature Representation and Training strategies in Temporal Action Localization

Temporal action localization has recently attracted significant interest...
research
07/27/2018

Diagnosing Error in Temporal Action Detectors

Despite the recent progress in video understanding and the continuous ra...
research
03/04/2021

Modeling Multi-Label Action Dependencies for Temporal Action Localization

Real-world videos contain many complex actions with inherent relationshi...
research
03/14/2022

RCL: Recurrent Continuous Localization for Temporal Action Detection

Temporal representation is the cornerstone of modern action detection te...
research
09/12/2023

Human Action Co-occurrence in Lifestyle Vlogs using Graph Link Prediction

We introduce the task of automatic human action co-occurrence identifica...
research
11/04/2019

Temporal Action Localization using Long Short-Term Dependency

Temporal action localization in untrimmed videos is an important but dif...

Please sign up or login with your details

Forgot password? Click here to reset