
Safely Learning to Control the Constrained Linear Quadratic Regulator
We study the constrained linear quadratic regulator with unknown dynamic...
Certainty Equivalent Control of LQR is Efficient
We study the performance of the certainty equivalent controller on the L...
The Gap Between ModelBased and ModelFree Methods on the Linear Quadratic Regulator: An Asymptotic Viewpoint
The effectiveness of modelbased versus modelfree methods is a longsta...
Regret Bounds for Robust Adaptive Control of the Linear Quadratic Regulator
We consider adaptive control of the Linear Quadratic Regulator (LQR), wh...
From selftuning regulators to reinforcement learning and back again
Machine and reinforcement learning (RL) are being applied to plan and co...
Learning Control Barrier Functions from Expert Demonstrations
Inspired by the success of imitation and inverse reinforcement learning ...
On the Sample Complexity of the Linear Quadratic Regulator
This paper addresses the optimal control problem known as the Linear Qua...
CYCLADES: Conflictfree Asynchronous Machine Learning
We present CYCLADES, a general framework for parallelizing stochastic op...
Large Scale Kernel Learning using Block Coordinate Descent
We demonstrate that distributed block coordinate descent can quickly sol...
LeastSquares Temporal Difference Learning for the Linear Quadratic Regulator
Reinforcement learning (RL) has been successfully used to solve many con...
Learning Without Mixing: Towards A Sharp Analysis of Linear System Identification
We prove that the ordinary leastsquares (OLS) estimator attains nearly ...
Learning Contracting Vector Fields For Stable Imitation Learning
We propose a new nonparametric framework for learning incrementally sta...
Minimax Lower Bounds for H_∞Norm Estimation
The problem of estimating the H_∞norm of an LTI system from noisy input...
Finitetime Analysis of Approximate Policy Iteration for the Linear Quadratic Regulator
We study the sample complexity of approximate policy iteration (PI) for ...
A Tutorial on Concentration Bounds for System Identification
We provide a brief tutorial on the use of concentration inequalities as ...
Stephen Tu
