-
Provably Efficient Safe Exploration via Primal-Dual Policy Optimization
We study the Safe Reinforcement Learning (SRL) problem using the Constra...
read it
-
Global exponential stability of primal-dual gradient flow dynamics based on the proximal augmented Lagrangian: A Lyapunov-based approach
For a class of nonsmooth composite optimization problems with linear equ...
read it
-
Fast multi-agent temporal-difference learning via homotopy stochastic primal-dual optimization
We consider a distributed multi-agent policy evaluation problem in reinf...
read it

Dongsheng Ding
is this you? claim profile