Few-shot Action Recognition via Improved Attention with Self-supervision

01/12/2020
by   Hongguang Zhang, et al.
12

Most existing few-shot learning methods in computer vision focus on class recognition given a few of still images as the input. In contrast, this paper tackles a more challenging task of few-shot action-recognition from video clips. We propose a simple framework which is both flexible and easy to implement. Our approach exploits joint spatial and temporal attention mechanisms in conjunction with self-supervised representation learning on videos. This design encourages the model to discover and encode spatial and temporal attention hotspots important during the similarity learning between dynamic video sequences for which locations of discriminative patterns vary in the spatio-temporal sense. Our method compares favorably with several state-of-the-art baselines on HMDB51, miniMIT and UCF101 datasets, demonstrating its superior performance.

READ FULL TEXT

page 1

page 3

page 5

research
06/16/2018

Two Stream Self-Supervised Learning for Action Recognition

We present a self-supervised approach using spatio-temporal signals betw...
research
11/18/2021

M2A: Motion Aware Attention for Accurate Video Action Recognition

Advancements in attention mechanisms have led to significant performance...
research
04/19/2022

Less than Few: Self-Shot Video Instance Segmentation

The goal of this paper is to bypass the need for labelled examples in fe...
research
11/17/2020

Semi-Supervised Few-Shot Atomic Action Recognition

Despite excellent progress has been made, the performance on action reco...
research
12/02/2021

Stacked Temporal Attention: Improving First-person Action Recognition by Emphasizing Discriminative Clips

First-person action recognition is a challenging task in video understan...
research
08/27/2016

Spatio-temporal Aware Non-negative Component Representation for Action Recognition

This paper presents a novel mid-level representation for action recognit...
research
03/18/2021

CLTA: Contents and Length-based Temporal Attention for Few-shot Action Recognition

Few-shot action recognition has attracted increasing attention due to th...

Please sign up or login with your details

Forgot password? Click here to reset