In this work, we study the concentration behavior of a stochastic
approx...
The current reinforcement learning algorithm uses forward-generated
traj...
Inspired by quantum switches, we consider a discrete-time multi-way matc...
In this work, we study policy-based methods for solving the reinforcemen...
Since reinforcement learning algorithms are notoriously data-intensive, ...
Q-learning with function approximation is one of the most empirically
su...
Stochastic approximation (SA) and stochastic gradient descent (SGD)
algo...
Parallel server system is a stochastic processing network widely studied...
In temporal difference (TD) learning, off-policy sampling is known to be...
In this paper, we develop a novel variant of off-policy natural actor-cr...
We study optimal service pricing in server farms where customers arrive
...
Markov Decision Processes are classically solved using Value Iteration a...
In this paper, we provide finite-sample convergence guarantees for an
of...
This paper develops an unified framework to study finite-sample converge...
Actor-critic style two-time-scale algorithms are very popular in
reinfor...
Motivated by applications in data center networks, in this paper, we stu...
Stochastic Approximation (SA) is a popular approach for solving fixed po...
Cryptocurrency networks such as Bitcoin have emerged as a distributed
al...
We study the policy evaluation problem in multi-agent reinforcement lear...
In this paper, we consider the model-free reinforcement learning problem...
Parallel iterative maximal matching algorithms (adapted for switching) h...
We consider a connection-level model proposed by Massoulié and Roberts
f...
We consider an input queued switch operating under the MaxWeight schedul...