A Closer Look at Few-Shot Video Classification: A New Baseline and Benchmark

10/24/2021
by   Zhenxi Zhu, et al.
0

The existing few-shot video classification methods often employ a meta-learning paradigm by designing customized temporal alignment module for similarity calculation. While significant progress has been made, these methods fail to focus on learning effective representations, and heavily rely on the ImageNet pre-training, which might be unreasonable for the few-shot recognition setting due to semantics overlap. In this paper, we aim to present an in-depth study on few-shot video classification by making three contributions. First, we perform a consistent comparative study on the existing metric-based methods to figure out their limitations in representation learning. Accordingly, we propose a simple classifier-based baseline without any temporal alignment that surprisingly outperforms the state-of-the-art meta-learning based methods. Second, we discover that there is a high correlation between the novel action class and the ImageNet object class, which is problematic in the few-shot recognition setting. Our results show that the performance of training from scratch drops significantly, which implies that the existing benchmarks cannot provide enough base data. Finally, we present a new benchmark with more base data to facilitate future few-shot video classification without pre-training. The code will be made available at https://github.com/MCG-NJU/FSL-Video.

READ FULL TEXT

page 1

page 5

page 6

research
03/09/2020

A New Meta-Baseline for Few-Shot Learning

Meta-learning has become a popular framework for few-shot learning in re...
research
09/11/2020

Meta Learning for Few-Shot One-class Classification

We propose a method that can perform one-class classification given only...
research
07/22/2022

Rethinking Few-Shot Object Detection on a Multi-Domain Benchmark

Most existing works on few-shot object detection (FSOD) focus on a setti...
research
10/23/2022

Few-Shot Meta Learning for Recognizing Facial Phenotypes of Genetic Disorders

Computer vision-based methods have valuable use cases in precision medic...
research
09/30/2021

Unsupervised Few-Shot Action Recognition via Action-Appearance Aligned Meta-Adaptation

We present MetaUVFS as the first Unsupervised Meta-learning algorithm fo...
research
08/06/2021

Simpler is Better: Few-shot Semantic Segmentation with Classifier Weight Transformer

A few-shot semantic segmentation model is typically composed of a CNN en...
research
12/11/2019

Associative Alignment for Few-shot Image Classification

Few-shot image classification aims at training a model by using only a f...

Please sign up or login with your details

Forgot password? Click here to reset