Generalized Chernoff Sampling for Active Learning and Structured Bandit Algorithms

12/15/2020
by   Subhojyoti Mukherjee, et al.
0

Active learning and structured stochastic bandit problems are intimately related to the classical problem of sequential experimental design. This paper studies active learning and best-arm identification in structured bandit settings from the viewpoint of active sequential hypothesis testing, a framework initiated by Chernoff (1959). We first characterize the sample complexity of Chernoff's original procedure by uncovering terms that reduce in significance as the allowed error probability δ→ 0, but are nevertheless relevant at any fixed value of δ > 0. While initially proposed for testing among finitely many hypotheses, we obtain the analogue of Chernoff sampling for the case when the hypotheses belong to a compact space. This makes it applicable to active learning and structured bandit problems, where the unknown parameter specifying the arm means is often assumed to be an element of Euclidean space. Empirically, we demonstrate the potential of our proposed approach for active learning of neural network models and in the linear bandit setting, where we observe that our general-purpose approach compares favorably to state-of-the-art methods.

READ FULL TEXT
research
01/30/2020

A Graph-Based Approach for Active Learning in Regression

Active learning aims to reduce labeling efforts by selectively asking hu...
research
02/14/2020

On State Variables, Bandit Problems and POMDPs

State variables are easily the most subtle dimension of sequential decis...
research
07/24/2018

A Structured Perspective of Volumes on Active Learning

Active Learning (AL) is a learning task that requires learners interacti...
research
05/20/2019

Gradient Ascent for Active Exploration in Bandit Problems

We present a new algorithm based on an gradient ascent for a general Act...
research
05/13/2021

Improved Algorithms for Agnostic Pool-based Active Classification

We consider active learning for binary classification in the agnostic po...
research
05/09/2019

Non-Asymptotic Sequential Tests for Overlapping Hypotheses and application to near optimal arm identification in bandit models

In this paper, we study sequential testing problems with overlapping hyp...
research
06/22/2022

Active Learning with Safety Constraints

Active learning methods have shown great promise in reducing the number ...

Please sign up or login with your details

Forgot password? Click here to reset