Few-Shot Transformation of Common Actions into Time and Space

04/06/2021
by   Pengwan Yang, et al.
11

This paper introduces the task of few-shot common action localization in time and space. Given a few trimmed support videos containing the same but unknown action, we strive for spatio-temporal localization of that action in a long untrimmed query video. We do not require any class labels, interval bounds, or bounding boxes. To address this challenging task, we introduce a novel few-shot transformer architecture with a dedicated encoder-decoder structure optimized for joint commonality learning and localization prediction, without the need for proposals. Experiments on our reorganizations of the AVA and UCF101-24 datasets show the effectiveness of our approach for few-shot common action localization, even when the support videos are noisy. Although we are not specifically designed for common localization in time only, we also compare favorably against the few-shot and one-shot state-of-the-art in this setting. Lastly, we demonstrate that the few-shot transformer is easily extended to common action localization per pixel.

READ FULL TEXT

page 6

page 7

page 11

page 12

research
08/13/2020

Localizing the Common Action Among a Few Videos

This paper strives to localize the temporal extent of an action in a lon...
research
04/26/2016

Spot On: Action Localization from Pointly-Supervised Proposals

We strive for spatio-temporal localization of actions in videos. The sta...
research
10/20/2021

Few-Shot Temporal Action Localization with Query Adaptive Transformer

Existing temporal action localization (TAL) works rely on a large number...
research
05/29/2018

Pointly-Supervised Action Localization

This paper strives for spatio-temporal localization of human actions in ...
research
12/17/2020

Multi-shot Temporal Event Localization: a Benchmark

Current developments in temporal event or action localization usually ta...
research
07/19/2017

Detecting Parts for Action Localization

In this paper, we propose a new framework for action localization that t...
research
03/21/2023

Multi-modal Prompting for Low-Shot Temporal Action Localization

In this paper, we consider the problem of temporal action localization u...

Please sign up or login with your details

Forgot password? Click here to reset