Lifelong Infinite Mixture Model Based on Knowledge-Driven Dirichlet Process

08/25/2021
by   Fei Ye, et al.
0

Recent research efforts in lifelong learning propose to grow a mixture of models to adapt to an increasing number of tasks. The proposed methodology shows promising results in overcoming catastrophic forgetting. However, the theory behind these successful models is still not well understood. In this paper, we perform the theoretical analysis for lifelong learning models by deriving the risk bounds based on the discrepancy distance between the probabilistic representation of data generated by the model and that corresponding to the target dataset. Inspired by the theoretical analysis, we introduce a new lifelong learning approach, namely the Lifelong Infinite Mixture (LIMix) model, which can automatically expand its network architectures or choose an appropriate component to adapt its parameters for learning a new task, while preserving its previously learnt information. We propose to incorporate the knowledge by means of Dirichlet processes by using a gating mechanism which computes the dependence between the knowledge learnt previously and stored in each component, and a new set of data. Besides, we train a compact Student model which can accumulate cross-domain representations over time and make quick inferences. The code is available at https://github.com/dtuzi123/Lifelong-infinite-mixture-model.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/15/2021

Lifelong Generative Modelling Using Dynamic Expansion Graph Model

Variational Autoencoders (VAEs) suffer from degenerated performance, whe...
research
10/12/2022

Task-Free Continual Learning via Online Discrepancy Distance Learning

Learning from non-stationary data streams, also called Task-Free Continu...
research
07/11/2022

Learning an evolved mixture model for task-free continual learning

Recently, continual learning (CL) has gained significant interest becaus...
research
10/12/2019

An Imputation model by Dirichlet Process Mixture of Elliptical Copulas for Data of Mixed Type

Copula-based methods provide a flexible approach to build missing data i...
research
05/22/2022

A Dirichlet Process Mixture of Robust Task Models for Scalable Lifelong Reinforcement Learning

While reinforcement learning (RL) algorithms are achieving state-of-the-...
research
02/22/2023

Learning Mixture Structure on Multi-Source Time Series for Probabilistic Forecasting

In many data-driven applications, collecting data from different sources...

Please sign up or login with your details

Forgot password? Click here to reset