Weakly-Supervised Online Action Segmentation in Multi-View Instructional Videos

03/24/2022
by   Reza Ghoddoosian, et al.
0

This paper addresses a new problem of weakly-supervised online action segmentation in instructional videos. We present a framework to segment streaming videos online at test time using Dynamic Programming and show its advantages over greedy sliding window approach. We improve our framework by introducing the Online-Offline Discrepancy Loss (OODL) to encourage the segmentation results to have a higher temporal consistency. Furthermore, only during training, we exploit frame-wise correspondence between multiple views as supervision for training weakly-labeled instructional videos. In particular, we investigate three different multi-view inference techniques to generate more accurate frame-wise pseudo ground-truth with no additional annotation cost. We present results and ablation studies on two benchmark multi-view datasets, Breakfast and IKEA ASM. Experimental results show efficacy of the proposed methods both qualitatively and quantitatively in two domains of cooking and assembly.

READ FULL TEXT

page 6

page 7

page 8

page 11

research
04/05/2019

Weakly Supervised Action Segmentation Using Mutual Consistency

Action segmentation is the task of predicting the actions in each frame ...
research
06/05/2020

WOAD: Weakly Supervised Online Action Detection in Untrimmed Videos

Online action detection in untrimmed videos aims to identify an action a...
research
02/27/2020

Set-Constrained Viterbi for Set-Supervised Action Segmentation

This paper is about weakly supervised action segmentation, where ground ...
research
10/12/2021

Hierarchical Modeling for Task Recognition and Action Segmentation in Weakly-Labeled Instructional Videos

This paper focuses on task recognition and action segmentation in weakly...
research
03/28/2018

Weakly-Supervised Action Segmentation with Iterative Soft Boundary Assignment

In this work, we address the task of weakly-supervised human action segm...
research
08/30/2022

A Circular Window-based Cascade Transformer for Online Action Detection

Online action detection aims at the accurate action prediction of the cu...
research
10/19/2022

Multi-view Tracking Using Weakly Supervised Human Motion Prediction

Multi-view approaches to people-tracking have the potential to better ha...

Please sign up or login with your details

Forgot password? Click here to reset