Learning Implicit Temporal Alignment for Few-shot Video Classification

05/11/2021
by   Songyang Zhang, et al.
4

Few-shot video classification aims to learn new video categories with only a few labeled examples, alleviating the burden of costly annotation in real-world applications. However, it is particularly challenging to learn a class-invariant spatial-temporal representation in such a setting. To address this, we propose a novel matching-based few-shot learning strategy for video sequences in this work. Our main idea is to introduce an implicit temporal alignment for a video pair, capable of estimating the similarity between them in an accurate and robust manner. Moreover, we design an effective context encoding module to incorporate spatial and feature channel context, resulting in better modeling of intra-class variations. To train our model, we develop a multi-task loss for learning video matching, leading to video features with better generalization. Extensive experimental results on two challenging benchmarks, show that our method outperforms the prior arts with a sizable margin on SomethingSomething-V2 and competitive results on Kinetics.

READ FULL TEXT
research
07/26/2021

Temporal Alignment Prediction for Few-Shot Video Classification

The goal of few-shot video classification is to learn a classification m...
research
06/27/2019

Few-Shot Video Classification via Temporal Alignment

There is a growing interest in learning a model which could recognize no...
research
10/13/2020

Few-shot Action Recognition with Implicit Temporal Alignment and Pair Similarity Optimization

Few-shot learning aims to recognize instances from novel classes with fe...
research
08/14/2023

Orthogonal Temporal Interpolation for Zero-Shot Video Recognition

Zero-shot video recognition (ZSVR) is a task that aims to recognize vide...
research
07/13/2020

Part-aware Prototype Network for Few-shot Semantic Segmentation

Few-shot semantic segmentation aims to learn to segment new object class...
research
11/19/2019

Cross-Class Relevance Learning for Temporal Concept Localization

We present a novel Cross-Class Relevance Learning approach for the task ...
research
08/18/2023

Boosting Few-shot Action Recognition with Graph-guided Hybrid Matching

Class prototype construction and matching are core aspects of few-shot a...

Please sign up or login with your details

Forgot password? Click here to reset