Learning a Condensed Frame for Memory-Efficient Video Class-Incremental Learning

11/02/2022
by   Yixuan Pei, et al.
0

Recent incremental learning for action recognition usually stores representative videos to mitigate catastrophic forgetting. However, only a few bulky videos can be stored due to the limited memory. To address this problem, we propose FrameMaker, a memory-efficient video class-incremental learning approach that learns to produce a condensed frame for each selected video. Specifically, FrameMaker is mainly composed of two crucial components: Frame Condensing and Instance-Specific Prompt. The former is to reduce the memory cost by preserving only one condensed frame instead of the whole video, while the latter aims to compensate the lost spatio-temporal details in the Frame Condensing stage. By this means, FrameMaker enables a remarkable reduction in memory but keep enough information that can be applied to following incremental tasks. Experimental results on multiple challenging benchmarks, i.e., HMDB51, UCF101 and Something-Something V2, demonstrate that FrameMaker can achieve better performance to recent advanced methods while consuming only 20 Additionally, under the same memory consumption conditions, FrameMaker significantly outperforms existing state-of-the-arts by a convincing margin.

READ FULL TEXT

page 9

page 19

research
07/21/2020

Uncertainty-Aware Weakly Supervised Action Detection from Untrimmed Videos

Despite the recent advances in video classification, progress in spatio-...
research
03/25/2022

Class-Incremental Learning for Action Recognition in Videos

We tackle catastrophic forgetting problem in the context of class-increm...
research
06/30/2021

When Video Classification Meets Incremental Classes

With the rapid development of social media, tremendous videos with new c...
research
07/22/2017

Spatio-temporal Human Action Localisation and Instance Segmentation in Temporally Untrimmed Videos

Current state-of-the-art human action recognition is focused on the clas...
research
07/24/2022

MAR: Masked Autoencoders for Efficient Action Recognition

Standard approaches for video recognition usually operate on the full in...
research
10/20/2022

YOWO-Plus: An Incremental Improvement

In this technical report, we would like to introduce our updates to YOWO...
research
02/03/2023

INV: Towards Streaming Incremental Neural Videos

Recent works in spatiotemporal radiance fields can produce photorealisti...

Please sign up or login with your details

Forgot password? Click here to reset