Batch Active Learning Using Determinantal Point Processes

06/19/2019
by   Erdem Bıyık, et al.
1

Data collection and labeling is one of the main challenges in employing machine learning algorithms in a variety of real-world applications with limited data. While active learning methods attempt to tackle this issue by labeling only the data samples that give high information, they generally suffer from large computational costs and are impractical in settings where data can be collected in parallel. Batch active learning methods attempt to overcome this computational burden by querying batches of samples at a time. To avoid redundancy between samples, previous works rely on some ad hoc combination of sample quality and diversity. In this paper, we present a new principled batch active learning method using Determinantal Point Processes, a repulsive point process that enables generating diverse batches of samples. We develop tractable algorithms to approximate the mode of a DPP distribution, and provide theoretical guarantees on the degree of approximation. We further demonstrate that an iterative greedy method for DPP maximization, which has lower computational costs but worse theoretical guarantees, still gives competitive results for batch active learning. Our experiments show the value of our methods on several datasets against state-of-the-art baselines.

READ FULL TEXT

page 3

page 4

page 8

page 9

page 10

page 11

page 13

page 14

research
10/10/2018

Batch Active Preference-Based Learning of Reward Functions

Data generation and labeling are usually an expensive part of learning f...
research
02/07/2020

Ready Policy One: World Building Through Active Learning

Model-Based Reinforcement Learning (MBRL) offers a promising direction f...
research
05/17/2018

Single Shot Active Learning using Pseudo Annotators

Standard myopic active learning assumes that human annotations are alway...
research
08/02/2011

On the Evaluation Criterions for the Active Learning Processes

In many data mining applications collection of sufficiently large datase...
research
11/01/2022

Batch Active Learning from the Perspective of Sparse Approximation

Active learning enables efficient model training by leveraging interacti...
research
09/22/2022

Fair Robust Active Learning by Joint Inconsistency

Fair Active Learning (FAL) utilized active learning techniques to achiev...
research
04/28/2021

Diversity-Aware Batch Active Learning for Dependency Parsing

While the predictive performance of modern statistical dependency parser...

Please sign up or login with your details

Forgot password? Click here to reset