Weakly Supervised Gaussian Networks for Action Detection

04/16/2019
by   Basura Fernando, et al.
0

Detecting temporal extents of human actions in videos is a challenging computer vision problem that require detailed manual supervision including frame-level labels. This expensive annotation process limits deploying action detectors on a limited number of categories. We propose a novel action recognition method, called WSGN, that can learn to detect actions from "weak supervision", video-level labels. WSGN learns to exploit both video-specific and dataset-wide statistics to predict relevance of each frame to an action category. We show that a combination of the local and global channels leads to significant gains in two standard benchmarks THUMOS14 and Charades. Our method improves more than 12 weakly supervised state-of-the-art methods and only 4 state-of-the-art supervised method in THUMOS14 dataset for action detection. Similarly, our method is only 0.3 method on challenging Charades dataset for action localisation.

READ FULL TEXT

page 1

page 2

research
06/05/2020

WOAD: Weakly Supervised Online Action Detection in Untrimmed Videos

Online action detection in untrimmed videos aims to identify an action a...
research
05/17/2018

NeuralNetwork-Viterbi: A Framework for Weakly Supervised Video Learning

Video learning is an important task in computer vision and has experienc...
research
11/24/2015

Fine-Grain Annotation of Cricket Videos

The recognition of human activities is one of the key problems in video ...
research
05/12/2022

Weakly-Supervised Action Detection Guided by Audio Narration

Videos are more well-organized curated data sources for visual concept l...
research
08/22/2019

3C-Net: Category Count and Center Loss for Weakly-Supervised Action Localization

Temporal action localization is a challenging computer vision problem wi...
research
09/30/2021

Deep Learning-based Action Detection in Untrimmed Videos: A Survey

Understanding human behavior and activity facilitates advancement of num...
research
12/03/2017

Multimodal Visual Concept Learning with Weakly Supervised Techniques

Despite the availability of a huge amount of video data accompanied by d...

Please sign up or login with your details

Forgot password? Click here to reset