We investigate the fixed-budget best-arm identification (BAI) problem fo...
This paper considers a variant of zero-sum matrix games where at each
ti...
Labeled data are critical to modern machine learning applications, but
o...
Learning to control unknown nonlinear dynamical systems is a fundamental...
Representation learning based on multi-task pretraining has become a pow...
To leverage the copious amount of data from source tasks and overcome th...
Automating warehouse operations can reduce logistics overhead costs,
ult...
We study the sample complexity of identifying an approximate equilibrium...
While much progress has been made in understanding the minimax sample
co...
In the stochastic contextual bandit setting, regret-minimizing algorithm...
Active learning methods have shown great promise in reducing the number ...
To leverage the power of big data from source tasks and overcome the sca...
Reward-free reinforcement learning (RL) considers the setting where the ...
Obtaining first-order regret bounds – regret bounds scaling not as the
w...
The best arm identification problem in the multi-armed bandit setting is...
We consider interactive learning in the realizable setting and develop a...
The level set estimation problem seeks to find all points in a domain X ...
This work considers the problem of selective-sampling for best-arm
ident...
The theory of reinforcement learning has focused on two fundamental prob...
We conduct theoretical studies on streaming-based active learning for bi...
We consider active learning for binary classification in the agnostic
po...
In recent years methods from optimal linear experimental design have bee...
We study episodic reinforcement learning under unknown adversarial
corru...
Exploration in unknown environments is a fundamental problem in reinforc...
Autonomous robot-assisted feeding requires the ability to acquire a wide...
In this paper we propose a novel experimental design-based algorithm to
...
This work proposes a procedure for designing algorithms for specific ada...
In many scientific settings there is a need for adaptive experimental de...
This paper proposes near-optimal algorithms for the pure-exploration lin...
We study the problem of estimating the distribution of effect sizes (the...
We propose an algorithm to actively estimate the parameters of a linear
...
Data scientists have relied on samples to analyze populations of interes...
In this paper we introduce the transductive linear bandit problem: given...
We consider two multi-armed bandit problems with n arms: (i) given an
ϵ ...
This paper establishes that optimistic algorithms attain gap-dependent a...
Machine learning (ML) techniques are enjoying rapidly increasing adoptio...
Hyperparameter tuning of multi-stage pipelines introduces a significant
...
This paper considers a multi-armed bandit game where the number of arms ...
Modern learning models are characterized by large hyperparameter spaces....
We propose an adaptive sampling approach for multiple testing which aims...
In this paper, we introduce the first principled adaptive-sampling proce...
We propose an alternative framework to existing setups for controlling f...
We propose a novel technique for analyzing adaptive sampling called the ...
The goal of ordinal embedding is to represent items as points in a
low-d...
Performance of machine learning algorithms depends critically on identif...
This paper studies the Best-of-K Bandit game: At each time the player ch...
Motivated by the task of hyperparameter optimization, we introduce the
n...
The dueling bandit problem is a variation of the classical multi-armed b...
The paper proposes a novel upper confidence bound (UCB) procedure for
id...
Sampling from distributions to find the one with the largest mean arises...