Cold Start Active Learning Strategies in the Context of Imbalanced Classification

01/25/2022
by   Etienne Brangbour, et al.
0

We present novel active learning strategies dedicated to providing a solution to the cold start stage, i.e. initializing the classification of a large set of data with no attached labels. Moreover, proposed strategies are designed to handle an imbalanced context in which random selection is highly inefficient. Specifically, our active learning iterations address label scarcity and imbalance using element scores, combining information extracted from a clustering structure to a label propagation model. The strategy is illustrated by a case study on annotating Twitter content w.r.t. testimonies of a real flood event. We show that our method effectively copes with class imbalance, by boosting the recall of samples from the minority class.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/09/2021

Class-Balanced Active Learning for Image Classification

Active learning aims to reduce the labeling effort that is required to t...
research
07/16/2021

Active learning for online training in imbalanced data streams under cold start

Labeled data is essential in modern systems that rely on Machine Learnin...
research
11/18/2019

Online Adaptive Asymmetric Active Learning with Limited Budgets

Online Active Learning (OAL) aims to manage unlabeled datastream by sele...
research
01/24/2018

Support Vector Machine Active Learning Algorithms with Query-by-Committee versus Closest-to-Hyperplane Selection

This paper investigates and evaluates support vector machine active lear...
research
05/03/2023

Transfer and Active Learning for Dissonance Detection: Addressing the Rare-Class Challenge

While transformer-based systems have enabled greater accuracies with few...
research
10/10/2020

On the Importance of Adaptive Data Collection for Extremely Imbalanced Pairwise Tasks

Many pairwise classification tasks, such as paraphrase detection and ope...
research
10/04/2020

Data-efficient Online Classification with Siamese Networks and Active Learning

An ever increasing volume of data is nowadays becoming available in a st...

Please sign up or login with your details

Forgot password? Click here to reset