Therbligs in Action: Video Understanding through Motion Primitives

04/06/2023
by   Eadom Dessalene, et al.
0

In this paper we introduce a rule-based, compositional, and hierarchical modeling of action using Therbligs as our atoms. Introducing these atoms provides us with a consistent, expressive, contact-centered representation of action. Over the atoms we introduce a differentiable method of rule-based reasoning to regularize for logical consistency. Our approach is complementary to other approaches in that the Therblig-based representations produced by our architecture augment rather than replace existing architectures' representations. We release the first Therblig-centered annotations over two popular video datasets - EPIC Kitchens 100 and 50-Salads. We also broadly demonstrate benefits to adopting Therblig representations through evaluation on the following tasks: action segmentation, action anticipation, and action recognition - observing an average 10.5%/7.53%/6.5% relative improvement, respectively, over EPIC Kitchens and an average 8.9%/6.63%/4.8% relative improvement, respectively, over 50 Salads. Code and data will be made publicly available.

READ FULL TEXT

page 3

page 5

page 7

research
08/03/2020

Late Temporal Modeling in 3D CNN Architectures with BERT for Action Recognition

In this work, we combine 3D convolution with late temporal modeling for ...
research
02/01/2021

Forecasting Action through Contact Representations from First Person Video

Human actions involving hand manipulations are structured according to t...
research
10/15/2015

A Novel Approach for Human Action Recognition from Silhouette Images

In this paper, a novel human action recognition technique from video is ...
research
01/17/2016

Face-space Action Recognition by Face-Object Interactions

Action recognition in still images has seen major improvement in recent ...
research
03/19/2018

Featureless: Bypassing feature extraction in action categorization

This method introduces an efficient manner of learning action categories...
research
07/06/2016

VideoLSTM Convolves, Attends and Flows for Action Recognition

We present a new architecture for end-to-end sequence learning of action...
research
01/01/2023

Hierarchical Explanations for Video Action Recognition

We propose Hierarchical ProtoPNet: an interpretable network that explain...

Please sign up or login with your details

Forgot password? Click here to reset