TiDAL: Learning Training Dynamics for Active Learning

10/13/2022
by   Seong Min Kye, et al.
0

Active learning (AL) aims to select the most useful data samples from an unlabeled data pool and annotate them to expand the labeled dataset under a limited budget. Especially, uncertainty-based methods choose the most uncertain samples, which are known to be effective in improving model performance. However, AL literature often overlooks training dynamics (TD), defined as the ever-changing model behavior during optimization via stochastic gradient descent, even though other areas of literature have empirically shown that TD provides important clues for measuring the sample uncertainty. In this paper, we propose a novel AL method, Training Dynamics for Active Learning (TiDAL), which leverages the TD to quantify uncertainties of unlabeled data. Since tracking the TD of all the large-scale unlabeled data is impractical, TiDAL utilizes an additional prediction module that learns the TD of labeled data. To further justify the design of TiDAL, we provide theoretical and empirical evidence to argue the usefulness of leveraging TD for AL. Experimental results show that our TiDAL achieves better or comparable performance on both balanced and imbalanced benchmark datasets compared to state-of-the-art AL methods, which estimate data uncertainty using only static information after model training.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/16/2019

Consistency-Based Semi-Supervised Active Learning: Towards Minimizing Labeling Cost

Active learning (AL) integrates data labeling and model training to mini...
research
07/30/2021

When Deep Learners Change Their Mind: Learning Dynamics for Active Learning

Active learning aims to select samples to be annotated that yield the la...
research
08/13/2021

Jasmine: A New Active Learning Approach to Combat Cybercrime

Over the past decade, the advent of cybercrime has accelarated the resea...
research
04/08/2021

Relieving the Plateau: Active Semi-Supervised Learning for a Better Landscape

Deep learning (DL) relies on massive amounts of labeled data, and improv...
research
11/25/2021

Active Learning at the ImageNet Scale

Active learning (AL) algorithms aim to identify an optimal subset of dat...
research
07/22/2020

DEAL: Deep Evidential Active Learning for Image Classification

Convolutional Neural Networks (CNNs) have proven to be state-of-the-art ...
research
04/14/2019

Exploring Representativeness and Informativeness for Active Learning

How can we find a general way to choose the most suitable samples for tr...

Please sign up or login with your details

Forgot password? Click here to reset