Temporal Alignment Prediction for Few-Shot Video Classification

07/26/2021
by   Fei Pan, et al.
0

The goal of few-shot video classification is to learn a classification model with good generalization ability when trained with only a few labeled videos. However, it is difficult to learn discriminative feature representations for videos in such a setting. In this paper, we propose Temporal Alignment Prediction (TAP) based on sequence similarity learning for few-shot video classification. In order to obtain the similarity of a pair of videos, we predict the alignment scores between all pairs of temporal positions in the two videos with the temporal alignment prediction function. Besides, the inputs to this function are also equipped with the context information in the temporal domain. We evaluate TAP on two video classification benchmarks including Kinetics and Something-Something V2. The experimental results verify the effectiveness of TAP and show its superiority over state-of-the-art methods.

READ FULL TEXT
research
05/11/2021

Learning Implicit Temporal Alignment for Few-shot Video Classification

Few-shot video classification aims to learn new video categories with on...
research
08/22/2023

MEGA: Multimodal Alignment Aggregation and Distillation For Cinematic Video Segmentation

Previous research has studied the task of segmenting cinematic videos in...
research
06/27/2019

Few-Shot Video Classification via Temporal Alignment

There is a growing interest in learning a model which could recognize no...
research
05/10/2023

Few-shot Action Recognition via Intra- and Inter-Video Information Maximization

Current few-shot action recognition involves two primary sources of info...
research
10/19/2016

Learning Robust Video Synchronization without Annotations

Aligning video sequences is a fundamental yet still unsolved component f...
research
07/09/2020

Generalized Many-Way Few-Shot Video Classification

Few-shot learning methods operate in low data regimes. The aim is to lea...
research
03/22/2023

LSTM-based Video Quality Prediction Accounting for Temporal Distortions in Videoconferencing Calls

Current state-of-the-art video quality models, such as VMAF, give excell...

Please sign up or login with your details

Forgot password? Click here to reset