Nonstationary data stream classification with online active learning and siamese neural networks

10/03/2022
by   Kleanthis Malialis, et al.
0

We have witnessed in recent years an ever-growing volume of information becoming available in a streaming manner in various application areas. As a result, there is an emerging need for online learning methods that train predictive models on-the-fly. A series of open challenges, however, hinder their deployment in practice. These are, learning as data arrive in real-time one-by-one, learning from data with limited ground truth information, learning from nonstationary data, and learning from severely imbalanced data, while occupying a limited amount of memory for data storage. We propose the ActiSiamese algorithm, which addresses these challenges by combining online active learning, siamese networks, and a multi-queue memory. It develops a new density-based active learning strategy which considers similarity in the latent (rather than the input) space. We conduct an extensive study that compares the role of different active learning budgets and strategies, the performance with/without memory, the performance with/without ensembling, in both synthetic and real-world datasets, under different data nonstationarity characteristics and class imbalance levels. ActiSiamese outperforms baseline and state-of-the-art algorithms, and is effective under severe imbalance, even only when a fraction of the arriving instances' labels is available. We publicly release our code to the community.

READ FULL TEXT
research
10/04/2020

Data-efficient Online Classification with Siamese Networks and Active Learning

An ever increasing volume of data is nowadays becoming available in a st...
research
10/13/2022

Data augmentation on-the-fly and active learning in data stream classification

There is an emerging need for predictive models to be trained on-the-fly...
research
09/24/2020

Online Learning With Adaptive Rebalancing in Nonstationary Environments

An enormous and ever-growing volume of data is nowadays becoming availab...
research
04/11/2016

Active Learning for Online Recognition of Human Activities from Streaming Videos

Recognising human activities from streaming videos poses unique challeng...
research
11/18/2019

Online Adaptive Asymmetric Active Learning with Limited Budgets

Online Active Learning (OAL) aims to manage unlabeled datastream by sele...
research
07/16/2021

Active learning for online training in imbalanced data streams under cold start

Labeled data is essential in modern systems that rely on Machine Learnin...
research
09/27/2018

Queue-based Resampling for Online Class Imbalance Learning

Online class imbalance learning constitutes a new problem and an emergin...

Please sign up or login with your details

Forgot password? Click here to reset