A practical challenge in reinforcement learning are combinatorial action...
Partial monitoring is an expressive framework for sequential decision-ma...
Tuning machine parameters of particle accelerators is a repetitive and
t...
We consider Bayesian optimization in settings where observations can be
...
Combinatorial bandits with semi-bandit feedback generalize multi-armed
b...
We introduce a computationally efficient algorithm for finite stochastic...
Partial monitoring is a rich framework for sequential decision making un...
Robustness to distributional shift is one of the key challenges of
conte...
We introduce a novel stochastic contextual bandit model, where at each s...
Bayesian optimization is known to be difficult to scale to high dimensio...
Efficient exploration remains a major challenge for reinforcement learni...
In the stochastic bandit problem, the goal is to maximize an unknown fun...