DDG-Net: Discriminability-Driven Graph Network for Weakly-supervised Temporal Action Localization

07/31/2023
by   Xiaojun Tang, et al.
0

Weakly-supervised temporal action localization (WTAL) is a practical yet challenging task. Due to large-scale datasets, most existing methods use a network pretrained in other datasets to extract features, which are not suitable enough for WTAL. To address this problem, researchers design several modules for feature enhancement, which improve the performance of the localization module, especially modeling the temporal relationship between snippets. However, all of them neglect the adverse effects of ambiguous information, which would reduce the discriminability of others. Considering this phenomenon, we propose Discriminability-Driven Graph Network (DDG-Net), which explicitly models ambiguous snippets and discriminative snippets with well-designed connections, preventing the transmission of ambiguous information and enhancing the discriminability of snippet-level representations. Additionally, we propose feature consistency loss to prevent the assimilation of features and drive the graph convolution network to generate more discriminative representations. Extensive experiments on THUMOS14 and ActivityNet1.2 benchmarks demonstrate the effectiveness of DDG-Net, establishing new state-of-the-art results on both datasets. Source code is available at <https://github.com/XiaojunTang22/ICCV2023-DDGNet>.

READ FULL TEXT

page 1

page 4

page 7

page 8

research
08/22/2019

3C-Net: Category Count and Center Loss for Weakly-Supervised Action Localization

Temporal action localization is a challenging computer vision problem wi...
research
12/21/2021

ACGNet: Action Complement Graph Network for Weakly-supervised Temporal Action Localization

Weakly-supervised temporal action localization (WTAL) in untrimmed video...
research
07/19/2020

Geometry Constrained Weakly Supervised Object Localization

We propose a geometry constrained network, termed GC-Net, for weakly sup...
research
08/14/2021

Foreground-Action Consistency Network for Weakly Supervised Temporal Action Localization

As a challenging task of high-level video understanding, weakly supervis...
research
03/31/2022

Fine-grained Temporal Contrastive Learning for Weakly-supervised Temporal Action Localization

We target at the task of weakly-supervised action localization (WSAL), w...
research
07/27/2021

Cross-modal Consensus Network for Weakly Supervised Temporal Action Localization

Weakly supervised temporal action localization (WS-TAL) is a challenging...
research
12/14/2022

Shared Coupling-bridge for Weakly Supervised Local Feature Learning

Sparse local feature extraction is usually believed to be of important s...

Please sign up or login with your details

Forgot password? Click here to reset