Boosting Active Learning via Improving Test Performance

12/10/2021
by   Tianyang Wang, et al.
0

Central to active learning (AL) is what data should be selected for annotation. Existing works attempt to select highly uncertain or informative data for annotation. Nevertheless, it remains unclear how selected data impacts the test performance of the task model used in AL. In this work, we explore such an impact by theoretically proving that selecting unlabeled data of higher gradient norm leads to a lower upper bound of test loss, resulting in a better test performance. However, due to the lack of label information, directly computing gradient norm for unlabeled data is infeasible. To address this challenge, we propose two schemes, namely expected-gradnorm and entropy-gradnorm. The former computes the gradient norm by constructing an expected empirical loss while the latter constructs an unsupervised loss with entropy. Furthermore, we integrate the two schemes in a universal AL framework. We evaluate our method on classical image classification and semantic segmentation tasks. To demonstrate its competency in domain applications and its robustness to noise, we also validate our method on a cellular imaging analysis task, namely cryo-Electron Tomography subtomogram classification. Results demonstrate that our method achieves superior performance against the state-of-the-art. Our source code is available at https://github.com/xulabs/aitom

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/18/2020

Minimax Active Learning

Active learning aims to develop label-efficient algorithms by querying t...
research
07/29/2021

Semi-Supervised Active Learning with Temporal Output Discrepancy

While deep learning succeeds in a wide range of tasks, it highly depends...
research
07/20/2023

EdgeAL: An Edge Estimation Based Active Learning Approach for OCT Segmentation

Active learning algorithms have become increasingly popular for training...
research
08/21/2023

Test-time augmentation-based active learning and self-training for label-efficient segmentation

Deep learning techniques depend on large datasets whose annotation is ti...
research
07/11/2023

OpenAL: An Efficient Deep Active Learning Framework for Open-Set Pathology Image Classification

Active learning (AL) is an effective approach to select the most informa...
research
07/14/2023

Adaptive Region Selection for Active Learning in Whole Slide Image Semantic Segmentation

The process of annotating histological gigapixel-sized whole slide image...
research
06/16/2023

On Orderings of Probability Vectors and Unsupervised Performance Estimation

Unsupervised performance estimation, or evaluating how well models perfo...

Please sign up or login with your details

Forgot password? Click here to reset