Iterative Frame-Level Representation Learning And Classification For Semi-Supervised Temporal Action Segmentation

12/02/2021
by   Dipika Singhania, et al.
9

Temporal action segmentation classifies the action of each frame in (long) video sequences. Due to the high cost of frame-wise labeling, we propose the first semi-supervised method for temporal action segmentation. Our method hinges on unsupervised representation learning, which, for temporal action segmentation, poses unique challenges. Actions in untrimmed videos vary in length and have unknown labels and start/end times. Ordering of actions across videos may also vary. We propose a novel way to learn frame-wise representations from temporal convolutional networks (TCNs) by clustering input features with added time-proximity condition and multi-resolution similarity. By merging representation learning with conventional supervised learning, we develop an "Iterative-Contrast-Classify (ICC)" semi-supervised learning scheme. With more labelled data, ICC progressively improves in performance; ICC semi-supervised learning, with 40 fully-supervised counterparts. Our ICC improves MoF by +1.8, +5.6, +2.5 Breakfast, 50Salads and GTEA respectively for 100

READ FULL TEXT

page 2

page 3

page 7

page 11

page 12

research
12/20/2022

C2F-TCN: A Framework for Semi and Fully Supervised Temporal Action Segmentation

Temporal action segmentation tags action labels for every frame in an in...
research
07/18/2022

Leveraging Action Affinity and Continuity for Semi-supervised Temporal Action Segmentation

We present a semi-supervised learning approach to the temporal action se...
research
06/21/2016

Tagger: Deep Unsupervised Perceptual Grouping

We present a framework for efficient perceptual inference that explicitl...
research
07/28/2016

Connectionist Temporal Modeling for Weakly Supervised Action Labeling

We propose a weakly-supervised framework for action labeling in video, w...
research
10/03/2019

Learning Temporal Action Proposals With Fewer Labels

Temporal action proposals are a common module in action detection pipeli...
research
10/21/2019

Icentia11K: An Unsupervised Representation Learning Dataset for Arrhythmia Subtype Discovery

We release the largest public ECG dataset of continuous raw signals for ...
research
01/17/2020

GraphBGS: Background Subtraction via Recovery of Graph Signals

Graph-based algorithms have been successful approaching the problems of ...

Please sign up or login with your details

Forgot password? Click here to reset