Weakly-Supervised Temporal Localization via Occurrence Count Learning

05/17/2019
by   Julien Schroeter, et al.
0

We propose a novel model for temporal detection and localization which allows the training of deep neural networks using only counts of event occurrences as training labels. This powerful weakly-supervised framework alleviates the burden of the imprecise and time-consuming process of annotating event locations in temporal data. Unlike existing methods, in which localization is explicitly achieved by design, our model learns localization implicitly as a byproduct of learning to count instances. This unique feature is a direct consequence of the model's theoretical properties. We validate the effectiveness of our approach in a number of experiments (drum hit and piano onset detection in audio, digit detection in images) and demonstrate performance comparable to that of fully-supervised state-of-the-art methods, despite much weaker training requirements.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/14/2017

C-WSL: Count-guided Weakly Supervised Localization

We introduce a count-guided weakly supervised localization (C-WSL) frame...
research
03/06/2021

Learning from Counting: Leveraging Temporal Classification for Weakly Supervised Object Localization and Detection

This paper reports a new solution of leveraging temporal classification ...
research
08/19/2023

Weakly-Supervised Action Localization by Hierarchically-structured Latent Attention Modeling

Weakly-supervised action localization aims to recognize and localize act...
research
07/12/2023

Temporal Label-Refinement for Weakly-Supervised Audio-Visual Event Localization

Audio-Visual Event Localization (AVEL) is the task of temporally localiz...
research
07/28/2017

A Weakly Supervised Approach to Train Temporal Relation Classifiers and Acquire Regular Event Pairs Simultaneously

Capabilities of detecting temporal relations between two events can bene...
research
08/12/2016

Self-paced Learning for Weakly Supervised Evidence Discovery in Multimedia Event Search

Multimedia event detection has been receiving increasing attention in re...
research
06/05/2023

Inflated 3D Convolution-Transformer for Weakly-supervised Carotid Stenosis Grading with Ultrasound Videos

Localization of the narrowest position of the vessel and corresponding v...

Please sign up or login with your details

Forgot password? Click here to reset