Class-Incremental Grouping Network for Continual Audio-Visual Learning

09/11/2023
by   Shentong Mo, et al.
0

Continual learning is a challenging problem in which models need to be trained on non-stationary data across sequential tasks for class-incremental learning. While previous methods have focused on using either regularization or rehearsal-based frameworks to alleviate catastrophic forgetting in image classification, they are limited to a single modality and cannot learn compact class-aware cross-modal representations for continual audio-visual learning. To address this gap, we propose a novel class-incremental grouping network (CIGN) that can learn category-wise semantic features to achieve continual audio-visual learning. Our CIGN leverages learnable audio-visual class tokens and audio-visual grouping to continually aggregate class-aware features. Additionally, it utilizes class tokens distillation and continual grouping to prevent forgetting parameters learned from previous tasks, thereby improving the model's ability to capture discriminative audio-visual categories. We conduct extensive experiments on VGGSound-Instruments, VGGSound-100, and VGG-Sound Sources benchmarks. Our experimental results demonstrate that the CIGN achieves state-of-the-art audio-visual class-incremental learning performance. Code is available at https://github.com/stoneMo/CIGN.

READ FULL TEXT
research
03/29/2023

Audio-Visual Grouping Network for Sound Localization from Mixtures

Sound source localization is a typical and challenging task that predict...
research
08/21/2023

Audio-Visual Class-Incremental Learning

In this paper, we introduce audio-visual class-incremental learning, a c...
research
05/11/2022

Contrastive Supervised Distillation for Continual Representation Learning

In this paper, we propose a novel training procedure for the continual r...
research
05/19/2023

AttriCLIP: A Non-Incremental Learner for Incremental Knowledge Learning

Continual learning aims to enable a model to incrementally learn knowled...
research
05/30/2023

Learning without Forgetting for Vision-Language Models

Class-Incremental Learning (CIL) or continual learning is a desired capa...
research
10/05/2022

Multi-stream Fusion for Class Incremental Learning in Pill Image Classification

Classifying pill categories from real-world images is crucial for variou...
research
11/29/2022

Isolation and Impartial Aggregation: A Paradigm of Incremental Learning without Interference

This paper focuses on the prevalent performance imbalance in the stages ...

Please sign up or login with your details

Forgot password? Click here to reset