SegTAD: Precise Temporal Action Detection via Semantic Segmentation

03/03/2022
by   Chen Zhao, et al.
2

Temporal action detection (TAD) is an important yet challenging task in video analysis. Most existing works draw inspiration from image object detection and tend to reformulate it as a proposal generation - classification problem. However, there are two caveats with this paradigm. First, proposals are not equipped with annotated labels, which have to be empirically compiled, thus the information in the annotations is not necessarily precisely employed in the model training process. Second, there are large variations in the temporal scale of actions, and neglecting this fact may lead to deficient representation in the video features. To address these issues and precisely model temporal action detection, we formulate the task of temporal action detection in a novel perspective of semantic segmentation. Owing to the 1-dimensional property of TAD, we are able to convert the coarse-grained detection annotations to fine-grained semantic segmentation annotations for free. We take advantage of them to provide precise supervision so as to mitigate the impact induced by the imprecise proposal labels. We propose an end-to-end framework SegTAD composed of a 1D semantic segmentation network (1D-SSN) and a proposal detection network (PDN).

READ FULL TEXT

page 2

page 4

page 7

research
11/28/2018

Multi-granularity Generator for Temporal Action Proposal

Temporal action proposal generation is an important task, aiming to loca...
research
07/20/2022

Spotting Temporally Precise, Fine-Grained Events in Video

We introduce the task of spotting temporally precise, fine-grained event...
research
08/07/2021

DeepFH Segmentations for Superpixel-based Object Proposal Refinement

Class-agnostic object proposal generation is an important first step in ...
research
07/14/2022

Proposal-Free Temporal Action Detection via Global Segmentation Mask Learning

Existing temporal action detection (TAD) methods rely on generating an o...
research
05/24/2019

Implicit Label Augmentation on Partially Annotated Clips via Temporally-Adaptive Features Learning

Partially annotated clips contain rich temporal contexts that can comple...
research
06/20/2021

Proposal Relation Network for Temporal Action Detection

This technical report presents our solution for temporal action detectio...
research
09/25/2022

Hand Hygiene Assessment via Joint Step Segmentation and Key Action Scorer

Hand hygiene is a standard six-step hand-washing action proposed by the ...

Please sign up or login with your details

Forgot password? Click here to reset