Foreground-Action Consistency Network for Weakly Supervised Temporal Action Localization

08/14/2021
by   Linjiang Huang, et al.
0

As a challenging task of high-level video understanding, weakly supervised temporal action localization has been attracting increasing attention. With only video annotations, most existing methods seek to handle this task with a localization-by-classification framework, which generally adopts a selector to select snippets of high probabilities of actions or namely the foreground. Nevertheless, the existing foreground selection strategies have a major limitation of only considering the unilateral relation from foreground to actions, which cannot guarantee the foreground-action consistency. In this paper, we present a framework named FAC-Net based on the I3D backbone, on which three branches are appended, named class-wise foreground classification branch, class-agnostic attention branch and multiple instance learning branch. First, our class-wise foreground classification branch regularizes the relation between actions and foreground to maximize the foreground-background separation. Besides, the class-agnostic attention branch and multiple instance learning branch are adopted to regularize the foreground-action consistency and help to learn a meaningful foreground classifier. Within each branch, we introduce a hybrid attention mechanism, which calculates multiple attention scores for each snippet, to focus on both discriminative and less-discriminative snippets to capture the full action boundaries. Experimental results on THUMOS14 and ActivityNet1.3 demonstrate the state-of-the-art performance of our method. Our code is available at https://github.com/LeonHLJ/FAC-Net.

READ FULL TEXT

page 3

page 7

page 8

research
03/28/2021

ACSNet: Action-Context Separation Network for Weakly Supervised Temporal Action Localization

The object of Weakly-supervised Temporal Action Localization (WS-TAL) is...
research
01/03/2021

A Hybrid Attention Mechanism for Weakly-Supervised Temporal Action Localization

Weakly supervised temporal action localization is a challenging vision t...
research
06/22/2022

Weakly-supervised Action Localization via Hierarchical Mining

Weakly-supervised action localization aims to localize and classify acti...
research
04/07/2021

ACM-Net: Action Context Modeling Network for Weakly-Supervised Temporal Action Localization

Weakly-supervised temporal action localization aims to localize action i...
research
05/10/2021

Action Shuffling for Weakly Supervised Temporal Localization

Weakly supervised action localization is a challenging task with extensi...
research
07/31/2023

DDG-Net: Discriminability-Driven Graph Network for Weakly-supervised Temporal Action Localization

Weakly-supervised temporal action localization (WTAL) is a practical yet...
research
05/01/2022

Convex Combination Consistency between Neighbors for Weakly-supervised Action Localization

In weakly-supervised temporal action localization (WS-TAL), the methods ...

Please sign up or login with your details

Forgot password? Click here to reset