Active Learning for Regression Using Greedy Sampling

08/08/2018
by   Dongrui Wu, et al.
0

Regression problems are pervasive in real-world applications. Generally a substantial amount of labeled samples are needed to build a regression model with good generalization ability. However, many times it is relatively easy to collect a large number of unlabeled samples, but time-consuming or expensive to label them. Active learning for regression (ALR) is a methodology to reduce the number of labeled samples, by selecting the most beneficial ones to label, instead of random selection. This paper proposes two new ALR approaches based on greedy sampling (GS). The first approach (GSy) selects new samples to increase the diversity in the output space, and the second (iGS) selects new samples to increase the diversity in both input and output spaces. Extensive experiments on 12 UCI and CMU StatLib datasets from various domains, and on 15 subjects on EEG-based driver drowsiness estimation, verified their effectiveness and robustness.

READ FULL TEXT

page 11

page 16

research
05/12/2018

Offline EEG-Based Driver Drowsiness Estimation Using Enhanced Batch-Mode Active Learning (EBMAL) for Regression

There are many important regression problems in real-world brain-compute...
research
03/26/2019

Active Stacking for Heart Rate Estimation

Heart rate estimation from electrocardiogram signals is very important f...
research
07/17/2019

Output-weighted optimal sampling for Bayesian regression and rare event statistics using few samples

For many important problems the quantity of interest (or output) is an u...
research
01/14/2020

Unsupervised Pool-Based Active Learning for Linear Regression

In many real-world machine learning applications, unlabeled data can be ...
research
08/08/2018

Affect Estimation in 3D Space Using Multi-Task Active Learning for Regression

Acquisition of labeled training samples for affective computing is usual...
research
08/25/2017

Active Expansion Sampling for Learning Feasible Domains in an Unbounded Input Space

Many engineering problems require identifying feasible domains under imp...
research
07/31/2023

DiffusAL: Coupling Active Learning with Graph Diffusion for Label-Efficient Node Classification

Node classification is one of the core tasks on attributed graphs, but s...

Please sign up or login with your details

Forgot password? Click here to reset