Bias-Aware Heapified Policy for Active Learning

11/18/2019
by   Wen-Yen Chang, et al.
0

The data efficiency of learning-based algorithms is more and more important since high-quality and clean data is expensive as well as hard to collect. In order to achieve high model performance with the least number of samples, active learning is a technique that queries the most important subset of data from the original dataset. In active learning domain, one of the mainstream research is the heuristic uncertainty-based method which is useful for the learning-based system. Recently, a few works propose to apply policy reinforcement learning (PRL) for querying important data. It seems more general than heuristic uncertainty-based method owing that PRL method depends on data feature which is reliable than human prior. However, there have two problems - sample inefficiency of policy learning and overconfidence, when applying PRL on active learning. To be more precise, sample inefficiency of policy learning occurs when sampling within a large action space, in the meanwhile, class imbalance can lead to the overconfidence. In this paper, we propose a bias-aware policy network called Heapified Active Learning (HAL), which prevents overconfidence, and improves sample efficiency of policy learning by heapified structure without ignoring global inforamtion(overview of the whole unlabeled set). In our experiment, HAL outperforms other baseline methods on MNIST dataset and duplicated MNIST. Last but not least, we investigate the generalization of the HAL policy learned on MNIST dataset by directly applying it on MNIST-M. We show that the agent can generalize and outperform directly-learned policy under constrained labeled sets.

READ FULL TEXT

page 1

page 3

research
08/08/2017

Learning how to Active Learn: A Deep Reinforcement Learning Approach

Active learning aims to select a small subset of data for annotation suc...
research
08/29/2018

Learning a Policy for Opportunistic Active Learning

Active learning identifies data points to label that are expected to be ...
research
01/08/2019

Risk-Aware Active Inverse Reinforcement Learning

Active learning from demonstration allows a robot to query a human for s...
research
08/07/2020

Deep Active Learning with Crowdsourcing Data for Privacy Policy Classification

Privacy policies are statements that notify users of the services' data ...
research
06/22/2020

Effective Version Space Reduction for Convolutional Neural Networks

In active learning, sampling bias could pose a serious inconsistency pro...
research
07/09/2020

IALE: Imitating Active Learner Ensembles

Active learning (AL) prioritizes the labeling of the most informative da...
research
08/16/2022

Generating a Terrain-Robustness Benchmark for Legged Locomotion: A Prototype via Terrain Authoring and Active Learning

Terrain-aware locomotion has become an emerging topic in legged robotics...

Please sign up or login with your details

Forgot password? Click here to reset