Neural Architecture for Online Ensemble Continual Learning

11/27/2022
by   Mateusz Wójcik, et al.
0

Continual learning with an increasing number of classes is a challenging task. The difficulty rises when each example is presented exactly once, which requires the model to learn online. Recent methods with classic parameter optimization procedures have been shown to struggle in such setups or have limitations like non-differentiable components or memory buffers. For this reason, we present the fully differentiable ensemble method that allows us to efficiently train an ensemble of neural networks in the end-to-end regime. The proposed technique achieves SOTA results without a memory buffer and clearly outperforms the reference methods. The conducted experiments have also shown a significant increase in the performance for small ensembles, which demonstrates the capability of obtaining relatively high classification accuracy with a reduced number of classifiers.

READ FULL TEXT

page 8

page 9

research
07/11/2023

Domain-Agnostic Neural Architecture for Class Incremental Continual Learning in Document Processing Platform

Production deployments in complex systems require ML architectures to be...
research
05/06/2021

Structured Ensembles: an Approach to Reduce the Memory Footprint of Ensemble Methods

In this paper, we propose a novel ensembling technique for deep neural n...
research
05/27/2021

Encoders and Ensembles for Task-Free Continual Learning

We present an architecture that is effective for continual learning in a...
research
05/14/2019

Resource-aware Elastic Swap Random Forest for Evolving Data Streams

Continual learning based on data stream mining deals with ubiquitous sou...
research
04/20/2023

A baseline on continual learning methods for video action recognition

Continual learning has recently attracted attention from the research co...
research
11/24/2020

Generalized Variational Continual Learning

Continual learning deals with training models on new tasks and datasets ...
research
04/27/2020

Differentiable Adaptive Computation Time for Visual Reasoning

This paper presents a novel attention-based algorithm for achieving adap...

Please sign up or login with your details

Forgot password? Click here to reset