A major challenge in reinforcement learning is to develop practical,
sam...
This paper presents new projection-free algorithms for Online Convex
Opt...
We develop the concept of exponential stochastic inequality (ESI), a nov...
We study the design of sample-efficient algorithms for reinforcement lea...
The aim of this paper is to design computationally-efficient and optimal...
In this paper, we leverage the rapid advances in imitation learning, a t...
In this paper, we develop new efficient projection-free algorithms for O...
We revisit the classic online portfolio selection problem, where at each...
In constrained convex optimization, existing methods based on the ellips...
Acquisition of data is a difficult task in many applications of machine
...
We introduce a new problem setting for continuous control called the LQR...
Conditional Value at Risk (CVaR) is a family of "coherent risk measures"...
We study Online Convex Optimization in the unbounded setting where neith...
We present a new PAC-Bayesian generalization bound. Standard bounds cont...
We aim to design adaptive online learning algorithms that take advantage...
We consider the setting of prediction with expert advice; a learner make...
We consider the setting of prediction with expert advice; a learner make...
Simulation-based training (SBT) is gaining popularity as a low-cost and
...