Timestamp-Supervised Action Segmentation in the Perspective of Clustering

12/22/2022
by   Dazhao Du, et al.
0

Video action segmentation aims to slice the video into several action segments. Recently, timestamp supervision has received much attention due to lower annotation costs. We find the frames near the boundaries of action segments are in the transition region between two consecutive actions and have unclear semantics, which we call ambiguous intervals. Most existing methods iteratively generate pseudo-labels for all frames in each video to train the segmentation model. However, ambiguous intervals are more likely to be assigned with noisy and incorrect pseudo-labels, which leads to performance degradation. We propose a novel framework to train the model under timestamp supervision including the following two parts. First, pseudo-label ensembling generates pseudo-label sequences with ambiguous intervals, where the frames have no pseudo-labels. Second, iterative clustering iteratively propagates the pseudo-labels to the ambiguous intervals by clustering, and thus updates the pseudo-label sequences to train the model. We further introduce a clustering loss, which encourages the features of frames within the same action segment more compact. Extensive experiments show the effectiveness of our method.

READ FULL TEXT

page 1

page 6

research
02/16/2022

Less is More: Surgical Phase Recognition from Timestamp Supervision

Surgical phase recognition is a fundamental task in computer-assisted su...
research
07/02/2022

Turning to a Teacher for Timestamp Supervised Temporal Action Segmentation

Temporal action segmentation in videos has drawn much attention recently...
research
03/04/2023

Improving Audio-Visual Video Parsing with Pseudo Visual Labels

Audio-Visual Video Parsing is a task to predict the events that occur in...
research
10/20/2021

Simpler Does It: Generating Semantic Labels with Objectness Guidance

Existing weakly or semi-supervised semantic segmentation methods utilize...
research
07/20/2022

A Generalized Robust Framework For Timestamp Supervision in Temporal Action Segmentation

In temporal action segmentation, Timestamp supervision requires only a h...
research
04/05/2021

Anchor-Constrained Viterbi for Set-Supervised Action Segmentation

This paper is about action segmentation under weak supervision in traini...
research
03/30/2023

Unsupervised Word Segmentation Using Temporal Gradient Pseudo-Labels

Unsupervised word segmentation in audio utterances is challenging as, in...

Please sign up or login with your details

Forgot password? Click here to reset