Proposal-Free Temporal Action Detection via Global Segmentation Mask Learning

07/14/2022
by   Sauradip Nag, et al.
11

Existing temporal action detection (TAD) methods rely on generating an overwhelmingly large number of proposals per video. This leads to complex model designs due to proposal generation and/or per-proposal action instance evaluation and the resultant high computational cost. In this work, for the first time, we propose a proposal-free Temporal Action detection model with Global Segmentation mask (TAGS). Our core idea is to learn a global segmentation mask of each action instance jointly at the full video length. The TAGS model differs significantly from the conventional proposal-based methods by focusing on global temporal representation learning to directly detect local start and end points of action instances without proposals. Further, by modeling TAD holistically rather than locally at the individual proposal level, TAGS needs a much simpler model architecture with lower computational cost. Extensive experiments show that despite its simpler design, TAGS outperforms existing TAD methods, achieving new state-of-the-art performance on two benchmarks. Importantly, it is   20x faster to train and  1.6x more efficient for inference. Our PyTorch implementation of TAGS is available at https://github.com/sauradip/TAGS .

READ FULL TEXT

page 18

page 19

research
07/14/2022

Semi-Supervised Temporal Action Detection with Proposal-Free Masking

Existing temporal action detection (TAD) methods rely on a large number ...
research
10/17/2017

Single Shot Temporal Action Detection

Temporal action detection is a very important yet challenging problem, s...
research
09/18/2021

Towards High-Quality Temporal Action Detection with Sparse Proposals

Temporal Action Detection (TAD) is an essential and challenging topic in...
research
11/27/2022

Post-Processing Temporal Action Detection

Existing Temporal Action Detection (TAD) methods typically take a pre-pr...
research
03/27/2023

DiffTAD: Temporal Action Detection with Proposal Denoising Diffusion

We propose a new formulation of temporal action detection (TAD) with den...
research
03/03/2022

SegTAD: Precise Temporal Action Detection via Semantic Segmentation

Temporal action detection (TAD) is an important yet challenging task in ...
research
03/06/2023

Faster Learning of Temporal Action Proposal via Sparse Multilevel Boundary Generator

Temporal action localization in videos presents significant challenges i...

Please sign up or login with your details

Forgot password? Click here to reset