Consistency-Based Semi-Supervised Active Learning: Towards Minimizing Labeling Cost

10/16/2019
by   Mingfei Gao, et al.
28

Active learning (AL) integrates data labeling and model training to minimize the labeling cost by prioritizing the selection of high value data that can best improve model performance. Readily-available unlabeled data are used for selection mechanisms, but are not used for model training in most conventional pool-based AL methods. To minimize the labeling cost, we unify unlabeled sample selection and model training based on two principles. First, we exploit both labeled and unlabeled data using semi-supervised learning (SSL) to distill information from unlabeled data that improves representation learning and sample selection. Second, we propose a simple yet effective selection metric that is coherent with the training objective such that the selected samples are effective at improving model performance. Experimental results demonstrate superior performance of our proposed principles for limited labeled data compared to alternative AL and SSL combinations. In addition, we study an important problem – "When can we start AL?". We propose a measure that is empirically correlated with the AL target loss and can be used to assist in determining the proper start point.

READ FULL TEXT
research
10/19/2020

Semi-supervised Batch Active Learning via Bilevel Optimization

Active learning is an effective technique for reducing the labeling cost...
research
10/13/2022

TiDAL: Learning Training Dynamics for Active Learning

Active learning (AL) aims to select the most useful data samples from an...
research
04/08/2021

Relieving the Plateau: Active Semi-Supervised Learning for a Better Landscape

Deep learning (DL) relies on massive amounts of labeled data, and improv...
research
03/15/2023

Active Semi-Supervised Learning by Exploring Per-Sample Uncertainty and Consistency

Active Learning (AL) and Semi-supervised Learning are two techniques tha...
research
09/13/2022

Warm Start Active Learning with Proxy Labels & Selection via Semi-Supervised Fine-Tuning

Which volume to annotate next is a challenging problem in building medic...
research
10/07/2021

Food Science Spectroscopy Model Training: Improving Data Efficiency Using Active Learning and Semi-Supervised Learning

The past decade witnesses a rapid development in the measurement and mon...
research
03/24/2023

Optimizing the Procedure of CT Segmentation Labeling

In Computed Tomography, machine learning is often used for automated dat...

Please sign up or login with your details

Forgot password? Click here to reset