Universal Prototype Transport for Zero-Shot Action Recognition and Localization

03/08/2022
by   Pascal Mettes, et al.
4

This work addresses the problem of recognizing action categories in videos for which no training examples are available. The current state-of-the-art enables such a zero-shot recognition by learning universal mappings from videos to a shared semantic space, either trained on large-scale seen actions or on objects. While effective, we find that universal action and object mappings are biased to their seen categories. Such biases are further amplified due to biases between seen and unseen categories in the semantic space. The compounding biases result in many unseen action categories simply never being selected during inference, hampering zero-shot progress. We seek to address this limitation and introduce universal prototype transport for zero-shot action recognition. The main idea is to re-position the semantic prototypes of unseen actions through transduction, i.e. by using the distribution of the unlabelled test set. For universal action models, we first seek to find a hyperspherical optimal transport mapping from unseen action prototypes to the set of all projected test videos. We then define a target prototype for each unseen action as the weighted Fréchet mean over the transport couplings. Equipped with a target prototype, we propose to re-position unseen action prototypes along the geodesic spanned by the original and target prototypes, acting as a form of semantic regularization. For universal object models, we outline a variant that defines target prototypes based on an optimal transport between unseen action prototypes and semantic object prototypes. Empirically, we show that universal prototype transport diminishes the biased selection of unseen action prototypes and boosts both universal action and object models, resulting in state-of-the-art performance for zero-shot classification and spatio-temporal localization.

READ FULL TEXT

page 3

page 13

research
10/26/2021

Zero-Shot Action Recognition from Diverse Object-Scene Compositions

This paper investigates the problem of zero-shot action recognition, in ...
research
04/18/2019

Out-of-Distribution Detection for Generalized Zero-Shot Action Recognition

Generalized zero-shot action recognition is a challenging problem, where...
research
03/22/2018

Towards Universal Representation for Unseen Action Recognition

Unseen Action Recognition (UAR) aims to recognise novel action categorie...
research
04/08/2021

Few-Shot Action Recognition with Compromised Metric via Optimal Transport

Although vital to computer vision systems, few-shot action recognition i...
research
04/10/2021

Object Priors for Classifying and Localizing Unseen Actions

This work strives for the classification and localization of human actio...
research
06/17/2022

Learning Using Privileged Information for Zero-Shot Action Recognition

Zero-Shot Action Recognition (ZSAR) aims to recognize video actions that...
research
05/28/2023

Z-GMOT: Zero-shot Generic Multiple Object Tracking

Despite the significant progress made in recent years, Multi-Object Trac...

Please sign up or login with your details

Forgot password? Click here to reset