The aim of this paper is to improve the understanding of the optimizatio...
We study the performance of policy gradient methods for the subclass of
...
We study the problem of finding the Nash equilibrium in a two-player zer...
There are much recent interests in solving noncovnex min-max optimizatio...
We consider a discounted cost constrained Markov decision process (CMDP)...
We study a novel two-time-scale stochastic gradient method for solving
o...
We consider the problem of Byzantine fault-tolerance in federated machin...
We study the so-called two-time-scale stochastic approximation, a
simula...
Actor-critic style two-time-scale algorithms are very popular in
reinfor...
Two-time-scale stochastic approximation, a generalized version of the po...
Stochastic approximation, a data-driven approach for finding the fixed p...
This paper considers the problem of Byzantine fault-tolerance in multi-a...
Motivated by broad applications in reinforcement learning and federated
...
Motivated by broad applications in reinforcement learning and machine
le...
Motivated by their broad applications in reinforcement learning, we stud...
We study the policy evaluation problem in multi-agent reinforcement lear...
In this paper, we consider the model-free reinforcement learning problem...