Active Learning with Combinatorial Coverage

02/28/2023
by   Sai Prathyush Katragadda, et al.
0

Active learning is a practical field of machine learning that automates the process of selecting which data to label. Current methods are effective in reducing the burden of data labeling but are heavily model-reliant. This has led to the inability of sampled data to be transferred to new models as well as issues with sampling bias. Both issues are of crucial concern in machine learning deployment. We propose active learning methods utilizing combinatorial coverage to overcome these issues. The proposed methods are data-centric, as opposed to model-centric, and through our experiments we show that the inclusion of coverage in active learning leads to sampling data that tends to be the best in transferring to better performing models and has a competitive sampling bias compared to benchmark methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/13/2021

Mitigating Sampling Bias and Improving Robustness in Active Learning

This paper presents simple and efficient methods to mitigate sampling bi...
research
08/02/2016

Can Active Learning Experience Be Transferred?

Active learning is an important machine learning problem in reducing the...
research
09/22/2020

Model-Centric and Data-Centric Aspects of Active Learning for Neural Network Models

We study different data-centric and model-centric aspects of active lear...
research
12/17/2021

An overview of active learning methods for insurance with fairness appreciation

This paper addresses and solves some challenges in the adoption of machi...
research
12/13/2021

Addressing Bias in Active Learning with Depth Uncertainty Networks... or Not

Farquhar et al. [2021] show that correcting for active learning bias wit...
research
08/21/2023

Overcoming Overconfidence for Active Learning

It is not an exaggeration to say that the recent progress in artificial ...
research
12/20/2022

Active sampling: A machine-learning-assisted framework for finite population inference with optimal subsamples

Data subsampling has become widely recognized as a tool to overcome comp...

Please sign up or login with your details

Forgot password? Click here to reset