Segregated Temporal Assembly Recurrent Networks for Weakly Supervised Multiple Action Detection

11/19/2018
by   Yunlu Xu, et al.
0

This paper proposes a segregated temporal assembly recurrent (STAR) network for weakly-supervised multiple action detection. The model learns from untrimmed videos with only supervision of video-level labels and makes prediction of intervals of multiple actions. Specifically, we first assemble video clips according to class labels by an attention mechanism that learns class-variable attention weights and thus helps the noise relieving from background or other actions. Secondly, we build temporal relationship between actions by feeding the assembled features into an enhanced recurrent neural network. Finally, we transform the output of recurrent neural network into the corresponding action distribution. In order to generate more precise temporal proposals, we design a score term called segregated temporal gradient-weighted class activation mapping (ST-GradCAM) fused with attention weights. Experiments on THUMOS'14 and ActivityNet1.3 datasets show that our approach outperforms the state-of-the-art weakly-supervised method, and performs at par with the fully-supervised counterparts.

READ FULL TEXT

page 1

page 4

page 7

research
12/14/2017

Weakly Supervised Action Localization by Sparse Temporal Pooling Network

We propose a weakly supervised temporal action localization algorithm on...
research
08/19/2019

Weakly-supervised Action Localization with Background Modeling

We describe a latent approach that learns to detect actions in long sequ...
research
06/02/2017

Temporal Action Labeling using Action Sets

Action detection and temporal segmentation of actions in videos are topi...
research
05/07/2023

Video-Specific Query-Key Attention Modeling for Weakly-Supervised Temporal Action Localization

Weakly-supervised temporal action localization aims to identify and loca...
research
01/03/2021

A Hybrid Attention Mechanism for Weakly-Supervised Temporal Action Localization

Weakly supervised temporal action localization is a challenging vision t...
research
10/22/2019

Weakly-Supervised Completion Moment Detection using Temporal Attention

Monitoring the progression of an action towards completion offers fine g...
research
03/14/2016

Visual Concept Recognition and Localization via Iterative Introspection

Convolutional neural networks have been shown to develop internal repres...

Please sign up or login with your details

Forgot password? Click here to reset