TNT: Text-Conditioned Network with Transductive Inference for Few-Shot Video Classification

06/21/2021
by   Andrés Villa, et al.
0

Recently, few-shot learning has received increasing interest. Existing efforts have been focused on image classification, with very few attempts dedicated to the more challenging few-shot video classification problem. These few attempts aim to effectively exploit the temporal dimension in videos for better learning in low data regimes. However, they have largely ignored a key characteristic of video which could be vital for few-shot recognition, that is, videos are often accompanied by rich text descriptions. In this paper, for the first time, we propose to leverage these human-provided textual descriptions as privileged information when training a few-shot video classification model. Specifically, we formulate a text-based task conditioner to adapt video features to the few-shot learning task. Our model follows a transductive setting where query samples and support textual descriptions can be used to update the support set class prototype to further improve the task-adaptation ability of the model. Our model obtains state-of-the-art performance on four challenging benchmarks in few-shot video action classification.

READ FULL TEXT

page 1

page 2

page 4

page 5

page 6

page 7

page 9

page 10

research
03/06/2023

CLIP-guided Prototype Modulating for Few-shot Action Recognition

Learning from large-scale contrastive language-image pre-training like C...
research
07/09/2020

Generalized Many-Way Few-Shot Video Classification

Few-shot learning methods operate in low data regimes. The aim is to lea...
research
09/14/2019

Metric-Based Few-Shot Learning for Video Action Recognition

In the few-shot scenario, a learner must effectively generalize to unsee...
research
05/30/2022

Task-Prior Conditional Variational Auto-Encoder for Few-Shot Image Classification

Transductive methods always outperform inductive methods in few-shot ima...
research
08/04/2022

TIC: Text-Guided Image Colorization

Image colorization is a well-known problem in computer vision. However, ...
research
04/19/2022

Less than Few: Self-Shot Video Instance Segmentation

The goal of this paper is to bypass the need for labelled examples in fe...
research
05/17/2022

Uncertainty-based Network for Few-shot Image Classification

The transductive inference is an effective technique in the few-shot lea...

Please sign up or login with your details

Forgot password? Click here to reset