Diffusion Action Segmentation

03/31/2023
by   Daochang Liu, et al.
0

Temporal action segmentation is crucial for understanding long-form videos. Previous works on this task commonly adopt an iterative refinement paradigm by using multi-stage models. Our paper proposes an essentially different framework via denoising diffusion models, which nonetheless shares the same inherent spirit of such iterative refinement. In this framework, action predictions are progressively generated from random noise with input video features as conditions. To enhance the modeling of three striking characteristics of human actions, including the position prior, the boundary ambiguity, and the relational dependency, we devise a unified masking strategy for the conditioning inputs in our framework. Extensive experiments on three benchmark datasets, i.e., GTEA, 50Salads, and Breakfast, are performed and the proposed method achieves superior or comparable results to state-of-the-art methods, showing the effectiveness of a generative approach for action segmentation. Our codes will be made available.

READ FULL TEXT

page 7

page 8

page 13

page 14

page 15

research
08/01/2023

Diffusion Model for Camouflaged Object Detection

Camouflaged object detection is a challenging task that aims to identify...
research
03/28/2018

Weakly-Supervised Action Segmentation with Iterative Soft Boundary Assignment

In this work, we address the task of weakly-supervised human action segm...
research
07/14/2020

Alleviating Over-segmentation Errors by Detecting Action Boundaries

We propose an effective framework for the temporal action segmentation t...
research
03/27/2023

DiffTAD: Temporal Action Detection with Proposal Denoising Diffusion

We propose a new formulation of temporal action detection (TAD) with den...
research
04/04/2023

DIR-AS: Decoupling Individual Identification and Temporal Reasoning for Action Segmentation

Fully supervised action segmentation works on frame-wise action recognit...
research
05/29/2023

CamoDiffusion: Camouflaged Object Detection via Conditional Diffusion Models

Camouflaged Object Detection (COD) is a challenging task in computer vis...
research
04/21/2023

Don't worry about mistakes! Glass Segmentation Network via Mistake Correction

Recall one time when we were in an unfamiliar mall. We might mistakenly ...

Please sign up or login with your details

Forgot password? Click here to reset