LAC: Latent Action Composition for Skeleton-based Action Segmentation

08/28/2023
by   Di Yang, et al.
0

Skeleton-based action segmentation requires recognizing composable actions in untrimmed videos. Current approaches decouple this problem by first extracting local visual features from skeleton sequences and then processing them by a temporal model to classify frame-wise actions. However, their performances remain limited as the visual features cannot sufficiently express composable actions. In this context, we propose Latent Action Composition (LAC), a novel self-supervised framework aiming at learning from synthesized composable motions for skeleton-based action segmentation. LAC is composed of a novel generation module towards synthesizing new sequences. Specifically, we design a linear latent space in the generator to represent primitive motion. New composed motions can be synthesized by simply performing arithmetic operations on latent representations of multiple input skeleton sequences. LAC leverages such synthesized sequences, which have large diversity and complexity, for learning visual representations of skeletons in both sequence and frame spaces via contrastive learning. The resulting visual encoder has a high expressive power and can be effectively transferred onto action segmentation tasks by end-to-end fine-tuning without the need for additional temporal models. We conduct a study focusing on transfer-learning and we show that representations learned from pre-trained LAC outperform the state-of-the-art by a large margin on TSU, Charades, PKU-MMD datasets.

READ FULL TEXT
research
08/31/2022

ViA: View-invariant Skeleton Action Representation Learning via Motion Retargeting

Current self-supervised approaches for skeleton action representation le...
research
04/18/2023

Self-Supervised 3D Action Representation Learning with Skeleton Cloud Colorization

3D Skeleton-based human action recognition has attracted increasing atte...
research
07/20/2022

An Efficient Framework for Few-shot Skeleton-based Temporal Action Segmentation

Temporal action segmentation (TAS) aims to classify and locate actions i...
research
09/12/2023

Action Segmentation Using 2D Skeleton Heatmaps

This paper presents a 2D skeleton-based action segmentation method with ...
research
12/03/2020

Sparse Semi-Supervised Action Recognition with Active Learning

Current state-of-the-art methods for skeleton-based action recognition a...
research
05/12/2020

Skeleton-Aware Networks for Deep Motion Retargeting

We introduce a novel deep learning framework for data-driven motion reta...
research
07/07/2023

Language-free Compositional Action Generation via Decoupling Refinement

Composing simple elements into complex concepts is crucial yet challengi...

Please sign up or login with your details

Forgot password? Click here to reset