On Initial Pools for Deep Active Learning

11/30/2020
by   Akshay L Chandra, et al.
0

Active Learning (AL) techniques aim to minimize the training data required to train a model for a given task. Pool-based AL techniques start with a small initial labeled pool and then iteratively pick batches of the most informative samples for labeling. Generally, the initial pool is sampled randomly and labeled to seed the AL iterations. While recent` studies have focused on evaluating the robustness of various query functions in AL, little to no attention has been given to the design of the initial labeled pool. Given the recent successes of learning representations in self-supervised/unsupervised ways, we propose to study if an intelligently sampled initial labeled pool can improve deep AL performance. We will investigate the effect of intelligently sampled initial labeled pools, including the use of self-supervised and unsupervised strategies, on deep AL methods. We describe our experimental details, implementation details, datasets, performance metrics as well as planned ablation studies in this proposal. If intelligently sampled initial pools improve AL performance, our work could make a positive contribution to boosting AL performance with no additional annotation, developing datasets with lesser annotation cost in general, and promoting further research in the use of unsupervised learning methods for AL.

READ FULL TEXT
research
01/25/2023

Toward Realistic Evaluation of Deep Active Learning Algorithms in Image Classification

Active Learning (AL) aims to reduce the labeling burden by interactively...
research
01/14/2020

Unsupervised Pool-Based Active Learning for Linear Regression

In many real-world machine learning applications, unlabeled data can be ...
research
09/28/2022

Active Transfer Prototypical Network: An Efficient Labeling Algorithm for Time-Series Data

The paucity of labeled data is a typical challenge in the automotive ind...
research
09/10/2021

Active learning for reducing labeling effort in text classification tasks

Labeling data can be an expensive task as it is usually performed manual...
research
05/21/2023

On the Limitations of Simulating Active Learning

Active learning (AL) is a human-and-model-in-the-loop paradigm that iter...
research
03/01/2020

Deep Active Learning for Biased Datasets via Fisher Kernel Self-Supervision

Active learning (AL) aims to minimize labeling efforts for data-demandin...
research
03/25/2022

A Comparative Survey of Deep Active Learning

Active Learning (AL) is a set of techniques for reducing labeling cost b...

Please sign up or login with your details

Forgot password? Click here to reset