Pool-Based Sequential Active Learning for Regression

05/12/2018
by   Dongrui Wu, et al.
0

Active learning is a machine learning approach for reducing the data labeling effort. Given a pool of unlabeled samples, it tries to select the most useful ones to label so that a model built from them can achieve the best possible performance. This paper focuses on pool-based sequential active learning for regression (ALR). We first propose three essential criteria that an ALR approach should consider in selecting the most useful unlabeled samples: informativeness, representativeness, and diversity, and compare four existing ALR approaches against them. We then propose a new ALR approach using passive sampling, which considers both the representativeness and the diversity in both the initialization and subsequent iterations. Remarkably, this approach can also be integrated with other existing ALR approaches in the literature to further improve the performance. Extensive experiments on 11 UCI, CMU StatLib, and UFL Media Core datasets from various domains verified the effectiveness of our proposed ALR approaches.

READ FULL TEXT

page 9

page 10

research
01/14/2020

Unsupervised Pool-Based Active Learning for Linear Regression

In many real-world machine learning applications, unlabeled data can be ...
research
01/30/2020

A Graph-Based Approach for Active Learning in Regression

Active learning aims to reduce labeling efforts by selectively asking hu...
research
10/22/2020

Pool-based sequential active learning with multi kernels

We study a pool-based sequential active learning (AL), in which one samp...
research
01/31/2023

Deep Active Learning for Scientific Computing in the Wild

Deep learning (DL) is revolutionizing the scientific computing community...
research
06/10/2022

In Defense of Core-set: A Density-aware Core-set Selection for Active Learning

Active learning enables the efficient construction of a labeled dataset ...
research
07/31/2023

DiffusAL: Coupling Active Learning with Graph Diffusion for Label-Efficient Node Classification

Node classification is one of the core tasks on attributed graphs, but s...
research
04/02/2019

Sequential Adaptive Design for Jump Regression Estimation

Selecting input data or design points for statistical models has been of...

Please sign up or login with your details

Forgot password? Click here to reset