Forgetful Active Learning with Switch Events: Efficient Sampling for Out-of-Distribution Data

01/12/2023
by   Ryan Benkert, et al.
0

This paper considers deep out-of-distribution active learning. In practice, fully trained neural networks interact randomly with out-of-distribution (OOD) inputs and map aberrant samples randomly within the model representation space. Since data representations are direct manifestations of the training distribution, the data selection process plays a crucial role in outlier robustness. For paradigms such as active learning, this is especially challenging since protocols must not only improve performance on the training distribution most effectively but further render a robust representation space. However, existing strategies directly base the data selection on the data representation of the unlabeled data which is random for OOD samples by definition. For this purpose, we introduce forgetful active learning with switch events (FALSE) - a novel active learning protocol for out-of-distribution active learning. Instead of defining sample importance on the data representation directly, we formulate "informativeness" with learning difficulty during training. Specifically, we approximate how often the network "forgets" unlabeled samples and query the most "forgotten" samples for annotation. We report up to 4.5% accuracy improvements in over 270 experiments, including four commonly used protocols, two OOD benchmarks, one in-distribution benchmark, and three different architectures.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/25/2023

Deep Active Learning with Contrastive Learning Under Realistic Data Pool Assumptions

Active learning aims to identify the most informative data from an unlab...
research
07/27/2022

ALBench: A Framework for Evaluating Active Learning in Object Detection

Active learning is an important technology for automated machine learnin...
research
06/23/2022

Patient Aware Active Learning for Fine-Grained OCT Classification

This paper considers making active learning more sensible from a medical...
research
08/16/2022

Active Bucketized Learning for ACOPF Optimization Proxies

This paper considers optimization proxies for Optimal Power Flow (OPF), ...
research
06/13/2022

On the reusability of samples in active learning

An interesting but not extensively studied question in active learning i...
research
06/30/2022

Data-Efficient Learning via Minimizing Hyperspherical Energy

Deep learning on large-scale data is dominant nowadays. The unprecedente...
research
10/13/2022

Meta-Query-Net: Resolving Purity-Informativeness Dilemma in Open-set Active Learning

Unlabeled data examples awaiting annotations contain open-set noise inev...

Please sign up or login with your details

Forgot password? Click here to reset