We investigate safe multi-agent reinforcement learning, where agents see...
This work is dedicated to the algorithm design in a competitive framewor...
In this paper, we study the system identification problem for linear
dis...
We study the scalable multi-agent reinforcement learning (MARL) with gen...
This paper studies the role of over-parametrization in solving non-conve...
We study risk-sensitive reinforcement learning (RL) based on an entropic...
We study the identification of a linear time-invariant dynamical system
...
Many fundamental low-rank optimization problems, such as matrix completi...
We study convex Constrained Markov Decision Processes (CMDPs) in which t...
We consider primal-dual-based reinforcement learning (RL) in episodic
co...
It is well-known that the Burer-Monteiro (B-M) factorization approach ca...
Entropy regularization is an efficient technique for encouraging explora...
Policy gradient (PG) methods are popular and efficient for large-scale
r...
We study entropy-regularized constrained Markov decision processes (CMDP...
In this paper, we study a general low-rank matrix recovery problem with
...
The paper studies the performance of the Model-Agnostic Meta-Learning (M...
The operation of power grids is becoming increasingly data-centric. Whil...
Nonconvex matrix recovery is known to contain no spurious local minima u...
We investigate the important problem of certifying stability of reinforc...
When the linear measurements of an instance of low-rank matrix recovery
...