In order for a bimanual robot to manipulate an object that is held by bo...
Visual model-based RL methods typically encode image observations into
l...
We propose a theoretical framework for studying the imitation of stochas...
Obtaining rigorous statistical guarantees for generalization under
distr...
A common pipeline in learning-based control is to iteratively estimate a...
Computing optimal, collision-free trajectories for high-dimensional syst...
Machine learning systems, especially with overparameterized deep neural
...
This paper studies the prediction of a target 𝐳 from a pair of
random va...
Smoothed online learning has emerged as a popular framework to mitigate ...
The problem of piecewise affine (PWA) regression and planning is of
foun...
Due to the drastic gap in complexity between sequential and batch statis...
We introduce the first direct policy search algorithm which provably
con...
We study online control of time-varying linear systems with unknown dyna...
Differentiable simulators promise faster computation time for reinforcem...
Reward-free reinforcement learning (RL) considers the setting where the ...
Obtaining first-order regret bounds – regret bounds scaling not as the
w...
Stabilizing an unknown control system is one of the most fundamental pro...
The theory of reinforcement learning has focused on two fundamental prob...
Thompson sampling and other Bayesian sequential decision-making algorith...
The widespread adoption of nonlinear Receding Horizon Control (RHC)
stra...
We study the problem of adaptive control of the linear quadratic regulat...
How do you incentivize self-interested agents to explore when they
prefe...
Exploration in unknown environments is a fundamental problem in reinforc...
We introduce a new problem setting for continuous control called the LQR...
Recent literature has made much progress in understanding online LQR:
a ...
We propose an algorithm for tabular episodic reinforcement learning with...
While real-world decisions involve many competing objectives, algorithmi...
We introduce a new algorithm for online linear-quadratic control in a kn...
Exploration is widely regarded as one of the most challenging aspects of...
We consider the problem of online adaptive control of the linear quadrat...
We consider the problem of controlling a possibly unknown linear dynamic...
We initiate the study of multi-stage episodic reinforcement learning und...
We investigate the computational complexity of several basic linear alge...
This paper establishes that optimistic algorithms attain gap-dependent a...
We analyze a simple prefiltered variation of the least squares estimator...
We study the adaptive sensing problem for the multiple source seeking
pr...
Much recent work on fairness in machine learning has focused on how well...
In this paper, we introduce the first principled adaptive-sampling proce...
Minimizing a convex, quadratic objective is a fundamental problem in mac...
We prove a query complexity lower bound for approximating the top r
dime...
Fairness in machine learning has predominantly been studied in static
cl...
We prove that the ordinary least-squares (OLS) estimator attains nearly
...
A common problem in machine learning is to rank a set of n items based o...
We establish that first-order methods avoid saddle points for almost all...
We prove a query complexity lower bound on rank-one principal
component ...
We propose a novel technique for analyzing adaptive sampling called the ...
This paper studies the Best-of-K Bandit game: At each time the player ch...
We show that gradient descent converges to a local minimizer, almost sur...