Leveraging triplet loss for unsupervised action segmentation

04/13/2023
by   E. Bueno-Benito, et al.
0

In this paper, we propose a novel fully unsupervised framework that learns action representations suitable for the action segmentation task from the single input video itself, without requiring any training data. Our method is a deep metric learning approach rooted in a shallow network with a triplet loss operating on similarity distributions and a novel triplet selection strategy that effectively models temporal and semantic priors to discover actions in the new representational space. Under these circumstances, we successfully recover temporal boundaries in the learned action representations with higher quality compared with existing unsupervised approaches. The proposed method is evaluated on two widely used benchmark datasets for the action segmentation task and it achieves competitive performance by applying a generic clustering algorithm on the learned representations.

READ FULL TEXT

page 1

page 3

page 4

page 7

research
12/20/2014

Deep metric learning using Triplet network

Deep learning has proven itself as a successful set of models for learni...
research
07/18/2022

Leveraging Action Affinity and Continuity for Semi-supervised Temporal Action Segmentation

We present a semi-supervised learning approach to the temporal action se...
research
03/15/2018

Temporal Human Action Segmentation via Dynamic Clustering

We present an effective dynamic clustering algorithm for the task of tem...
research
11/02/2020

Set Augmented Triplet Loss for Video Person Re-Identification

Modern video person re-identification (re-ID) machines are often trained...
research
03/19/2021

Improving Image co-segmentation via Deep Metric Learning

Deep Metric Learning (DML) is helpful in computer vision tasks. In this ...
research
03/09/2023

TAEC: Unsupervised Action Segmentation with Temporal-Aware Embedding and Clustering

Temporal action segmentation in untrimmed videos has gained increased at...
research
08/04/2018

Triplet Network with Attention for Speaker Diarization

In automatic speech processing systems, speaker diarization is a crucial...

Please sign up or login with your details

Forgot password? Click here to reset