Effects of Architectures on Continual Semantic Segmentation

02/21/2023
by   Tobias Kalb, et al.
0

Research in the field of Continual Semantic Segmentation is mainly investigating novel learning algorithms to overcome catastrophic forgetting of neural networks. Most recent publications have focused on improving learning algorithms without distinguishing effects caused by the choice of neural architecture.Therefore, we study how the choice of neural network architecture affects catastrophic forgetting in class- and domain-incremental semantic segmentation. Specifically, we compare the well-researched CNNs to recently proposed Transformers and Hybrid architectures, as well as the impact of the choice of novel normalization layers and different decoder heads. We find that traditional CNNs like ResNet have high plasticity but low stability, while transformer architectures are much more stable. When the inductive biases of CNN architectures are combined with transformers in hybrid architectures, it leads to higher plasticity and stability. The stability of these models can be explained by their ability to learn general features that are robust against distribution shifts. Experiments with different normalization layers show that Continual Normalization achieves the best trade-off in terms of adaptability and stability of the model. In the class-incremental setting, the choice of the normalization layer has much less impact. Our experiments suggest that the right choice of architecture can significantly reduce forgetting even with naive fine-tuning and confirm that for real-world applications, the architecture is an important factor in designing a continual learning model.

READ FULL TEXT

page 1

page 2

page 4

page 5

page 6

research
02/01/2022

Architecture Matters in Continual Learning

A large body of research in continual learning is devoted to overcoming ...
research
05/16/2023

CQural: A Novel CNN based Hybrid Architecture for Quantum Continual Machine Learning

Training machine learning models in an incremental fashion is not only i...
research
04/17/2022

Continual Hippocampus Segmentation with Transformers

In clinical settings, where acquisition conditions and patient populatio...
research
04/08/2023

Continual Learning for LiDAR Semantic Segmentation: Class-Incremental and Coarse-to-Fine strategies on Sparse Data

During the last few years, continual learning (CL) strategies for image ...
research
03/24/2023

Principles of Forgetting in Domain-Incremental Semantic Segmentation in Adverse Weather Conditions

Deep neural networks for scene perception in automated vehicles achieve ...
research
06/22/2021

SSUL: Semantic Segmentation with Unknown Label for Exemplar-based Class-Incremental Learning

We consider a class-incremental semantic segmentation (CISS) problem. Wh...
research
01/04/2022

Weakly-supervised continual learning for class-incremental segmentation

Transfer learning is a powerful way to adapt existing deep learning mode...

Please sign up or login with your details

Forgot password? Click here to reset