Connectionist Temporal Modeling for Weakly Supervised Action Labeling

07/28/2016
by   De-An Huang, et al.
0

We propose a weakly-supervised framework for action labeling in video, where only the order of occurring actions is required during training time. The key challenge is that the per-frame alignments between the input (video) and label (action) sequences are unknown during training. We address this by introducing the Extended Connectionist Temporal Classification (ECTC) framework to efficiently evaluate all possible alignments via dynamic programming and explicitly enforce their consistency with frame-to-frame visual similarities. This protects the model from distractions of visually inconsistent or degenerated alignments without the need of temporal supervision. We further extend our framework to the semi-supervised case when a few frames are sparsely annotated in a video. With less than 1 is able to outperform existing semi-supervised approaches and achieve comparable performance to that of fully supervised approaches.

READ FULL TEXT

page 11

page 12

page 14

research
06/02/2017

Temporal Action Labeling using Action Sets

Action detection and temporal segmentation of actions in videos are topi...
research
03/31/2020

SCT: Set Constrained Temporal Transformer for Set Supervised Action Segmentation

Temporal action segmentation is a topic of increasing interest, however,...
research
10/07/2016

Weakly supervised learning of actions from transcripts

We present an approach for weakly supervised learning of human actions f...
research
01/30/2019

Effective weakly supervised semantic frame induction using expression sharing in hierarchical hidden Markov models

We present a framework for the induction of semantic frames from utteran...
research
12/02/2021

Iterative Frame-Level Representation Learning And Classification For Semi-Supervised Temporal Action Segmentation

Temporal action segmentation classifies the action of each frame in (lon...
research
07/04/2014

Weakly Supervised Action Labeling in Videos Under Ordering Constraints

We are given a set of video clips, each one annotated with an ordered l...
research
03/27/2020

Weakly-Supervised Action Localization by Generative Attention Modeling

Weakly-supervised temporal action localization is a problem of learning ...

Please sign up or login with your details

Forgot password? Click here to reset