RCL: Recurrent Continuous Localization for Temporal Action Detection

03/14/2022
by   Qiang Wang, et al.
0

Temporal representation is the cornerstone of modern action detection techniques. State-of-the-art methods mostly rely on a dense anchoring scheme, where anchors are sampled uniformly over the temporal domain with a discretized grid, and then regress the accurate boundaries. In this paper, we revisit this foundational stage and introduce Recurrent Continuous Localization (RCL), which learns a fully continuous anchoring representation. Specifically, the proposed representation builds upon an explicit model conditioned with video embeddings and temporal coordinates, which ensure the capability of detecting segments with arbitrary length. To optimize the continuous representation, we develop an effective scale-invariant sampling strategy and recurrently refine the prediction in subsequent iterations. Our continuous anchoring scheme is fully differentiable, allowing to be seamlessly integrated into existing detectors, e.g., BMN and G-TAD. Extensive experiments on two benchmarks demonstrate that our continuous representation steadily surpasses other discretized counterparts by  2 on ActivtiyNet v1.3, outperforming all existing single-model detectors.

READ FULL TEXT

page 3

page 8

research
10/18/2019

AFO-TAD: Anchor-free One-Stage Detector for Temporal Action Detection

Temporal action detection is a fundamental yet challenging task in video...
research
11/06/2018

BLP - Boundary Likelihood Pinpointing Networks for Accurate Temporal Action Localization

Despite tremendous progress achieved in temporal action detection, state...
research
11/29/2018

Discovering Spatio-Temporal Action Tubes

In this paper, we address the challenging problem of spatial and tempora...
research
06/23/2022

Learning to Refactor Action and Co-occurrence Features for Temporal Action Localization

The main challenge of Temporal Action Localization is to retrieve subtle...
research
08/10/2017

Exploring Temporal Preservation Networks for Precise Temporal Action Localization

Temporal action localization is an important task of computer vision. Th...
research
04/16/2019

Decoupling Localization and Classification in Single Shot Temporal Action Detection

Video temporal action detection aims to temporally localize and recogniz...
research
07/27/2018

Diagnosing Error in Temporal Action Detectors

Despite the recent progress in video understanding and the continuous ra...

Please sign up or login with your details

Forgot password? Click here to reset