Hierarchical Explanations for Video Action Recognition

01/01/2023
by   Sadaf Gulshad, et al.
0

We propose Hierarchical ProtoPNet: an interpretable network that explains its reasoning process by considering the hierarchical relationship between classes. Different from previous methods that explain their reasoning process by dissecting the input image and finding the prototypical parts responsible for the classification, we propose to explain the reasoning process for video action classification by dissecting the input video frames on multiple levels of the class hierarchy. The explanations leverage the hierarchy to deal with uncertainty, akin to human reasoning: When we observe water and human activity, but no definitive action it can be recognized as the water sports parent class. Only after observing a person swimming can we definitively refine it to the swimming action. Experiments on ActivityNet and UCF-101 show performance improvements while providing multi-level explanations.

READ FULL TEXT

page 1

page 4

page 7

page 8

page 14

page 15

page 16

page 17

research
08/20/2020

Learning to Abstract and Predict Human Actions

Human activities are naturally structured as hierarchies unrolled over t...
research
12/13/2015

Action Recognition with Image Based CNN Features

Most of human actions consist of complex temporal compositions of more s...
research
07/20/2020

Hierarchical Contrastive Motion Learning for Video Action Recognition

One central question for video action recognition is how to model motion...
research
09/18/2019

Class Feature Pyramids for Video Explanation

Deep convolutional networks are widely used in video action recognition....
research
04/30/2019

Attentive Spatio-Temporal Representation Learning for Diving Classification

Competitive diving is a well recognized aquatic sport in which a person ...
research
08/18/2021

The Multi-Modal Video Reasoning and Analyzing Competition

In this paper, we introduce the Multi-Modal Video Reasoning and Analyzin...
research
04/06/2023

Therbligs in Action: Video Understanding through Motion Primitives

In this paper we introduce a rule-based, compositional, and hierarchical...

Please sign up or login with your details

Forgot password? Click here to reset