On Evaluating Weakly Supervised Action Segmentation Methods

05/19/2020
by   Yaser Souri, et al.
0

Action segmentation is the task of temporally segmenting every frame of an untrimmed video. Weakly supervised approaches to action segmentation, especially from transcripts have been of considerable interest to the computer vision community. In this work, we focus on two aspects of the use and evaluation of weakly supervised action segmentation approaches that are often overlooked: the performance variance over multiple training runs and the impact of selecting feature extractors for this task.To tackle the first problem, we train each method on the Breakfast dataset 5 times and provide average and standard deviation of the results. Our experiments show that the standard deviation over these repetitions is between 1 and 2.5 affects the comparison between different approaches. Furthermore, our investigation on feature extraction shows that, for the studied weakly-supervised action segmentation methods, higher-level I3D features perform worse than classical IDT features.

READ FULL TEXT

page 1

page 2

page 3

research
04/05/2019

Weakly Supervised Action Segmentation Using Mutual Consistency

Action segmentation is the task of predicting the actions in each frame ...
research
03/11/2021

Temporal Action Segmentation from Timestamp Supervision

Temporal action segmentation approaches have been very successful recent...
research
03/29/2020

Learning a Weakly-Supervised Video Actor-Action Segmentation Model with a Wise Selection

We address weakly-supervised video actor-action segmentation (VAAS), whi...
research
12/05/2017

Learning Pain from Action Unit Combinations: A Weakly Supervised Approach via Multiple Instance Learning

Facial pain expression is an important modality for assessing pain, espe...
research
05/07/2020

Learning to Segment Actions from Observation and Narration

We apply a generative segmental model of task structure, guided by narra...
research
05/17/2018

NeuralNetwork-Viterbi: A Framework for Weakly Supervised Video Learning

Video learning is an important task in computer vision and has experienc...
research
08/09/2021

FIFA: Fast Inference Approximation for Action Segmentation

We introduce FIFA, a fast approximate inference method for action segmen...

Please sign up or login with your details

Forgot password? Click here to reset