3C-Net: Category Count and Center Loss for Weakly-Supervised Action Localization

08/22/2019
by   Sanath Narayan, et al.
0

Temporal action localization is a challenging computer vision problem with numerous real-world applications. Most existing methods require laborious frame-level supervision to train action localization models. In this work, we propose a framework, called 3C-Net, which only requires video-level supervision (weak supervision) in the form of action category labels and the corresponding count. We introduce a novel formulation to learn discriminative action features with enhanced localization capabilities. Our joint formulation has three terms: a classification term to ensure the separability of learned action features, an adapted multi-label center loss term to enhance the action feature discriminability and a counting loss term to delineate adjacent action sequences, leading to improved localization. Comprehensive experiments are performed on two challenging benchmarks: THUMOS14 and ActivityNet 1.2. Our approach sets a new state-of-the-art for weakly-supervised temporal action localization on both datasets. On the THUMOS14 dataset, the proposed method achieves an absolute gain of 4.6 compared to the state-of-the-art. Source code is available at https://github.com/naraysa/3c-net.

READ FULL TEXT

page 2

page 8

research
07/31/2023

DDG-Net: Discriminability-Driven Graph Network for Weakly-supervised Temporal Action Localization

Weakly-supervised temporal action localization (WTAL) is a practical yet...
research
04/16/2019

Weakly Supervised Gaussian Networks for Action Detection

Detecting temporal extents of human actions in videos is a challenging c...
research
06/22/2022

Weakly-supervised Action Localization via Hierarchical Mining

Weakly-supervised action localization aims to localize and classify acti...
research
02/18/2020

Constraining Temporal Relationship for Action Localization

Recently, temporal action localization (TAL), i.e., finding specific act...
research
07/19/2020

Geometry Constrained Weakly Supervised Object Localization

We propose a geometry constrained network, termed GC-Net, for weakly sup...
research
12/13/2022

Dilation-Erosion for Single-Frame Supervised Temporal Action Localization

To balance the annotation labor and the granularity of supervision, sing...
research
11/15/2021

Weakly-Supervised Dense Action Anticipation

Dense anticipation aims to forecast future actions and their durations f...

Please sign up or login with your details

Forgot password? Click here to reset