Temporal-Relational CrossTransformers for Few-Shot Action Recognition

01/15/2021
by   Toby Perrett, et al.
0

We propose a novel approach to few-shot action recognition, finding temporally-corresponding frame tuples between the query and videos in the support set. Distinct from previous few-shot action recognition works, we construct class prototypes using the CrossTransformer attention mechanism to observe relevant sub-sequences of all support videos, rather than using class averages or single best matches. Video representations are formed from ordered tuples of varying numbers of frames, which allows sub-sequences of actions at different speeds and temporal offsets to be compared. Our proposed Temporal-Relational CrossTransformers achieve state-of-the-art results on both Kinetics and Something-Something V2 (SSv2), outperforming prior work on SSv2 by a wide margin (6.8 temporal relations. A detailed ablation showcases the importance of matching to multiple support set videos and learning higher-order relational CrossTransformers. Code is available at https://github.com/tobyperrett/trx

READ FULL TEXT

page 1

page 6

page 8

research
12/09/2021

Spatio-temporal Relation Modeling for Few-shot Action Recognition

We propose a novel few-shot action recognition framework, STRM, which en...
research
07/10/2021

TTAN: Two-Stage Temporal Alignment Network for Few-shot Action Recognition

Few-shot action recognition aims to recognize novel action classes (quer...
research
07/12/2022

Compound Prototype Matching for Few-shot Action Recognition

Few-shot action recognition aims to recognize novel action classes using...
research
07/05/2023

Task-Specific Alignment and Multiple Level Transformer for Few-Shot Action Recognition

In the research field of few-shot learning, the main difference between ...
research
04/21/2020

TAEN: Temporal Aware Embedding Network for Few-Shot Action Recognition

Classification of a new class entities requires collecting and annotatin...
research
12/13/2018

Dynamic Graph Modules for Modeling Higher-Order Interactions in Activity Recognition

Video action recognition, as a critical problem towards video understand...
research
04/03/2023

MoLo: Motion-augmented Long-short Contrastive Learning for Few-shot Action Recognition

Current state-of-the-art approaches for few-shot action recognition achi...

Please sign up or login with your details

Forgot password? Click here to reset