We present the OMG-CMDP! algorithm for regret minimization in adversaria...
We present the UC^3RL algorithm for regret minimization in Stochastic
Co...
We consider the problem of controlling an unknown linear dynamical syste...
We consider the problem of controlling an unknown linear dynamical syste...
We consider the task of learning to control a linear dynamical system un...
We identify a fundamental phenomenon of heterogeneous one dimensional ra...
We consider the problem of controlling a known linear dynamical system u...
We consider the problem of learning in Linear Quadratic Control systems ...
Different risk-related criteria have received recent interest in learnin...