
NearOptimal Regret Bounds for ModelFree RL in NonStationary Episodic MDPs
We consider modelfree reinforcement learning (RL) in nonstationary Mar...
read it

Reinforcement Learning in NonStationary DiscreteTime LinearQuadratic MeanField Games
In this paper, we study large population multiagent reinforcement learn...
read it

ModelBased MultiAgent RL in ZeroSum Markov Games with NearOptimal Sample Complexity
Modelbased reinforcement learning (RL), which finds an optimal policy u...
read it

POLYHOOT: MonteCarlo Planning in Continuous Space MDPs with NonAsymptotic Analysis
MonteCarlo planning, as exemplified by MonteCarlo Tree Search (MCTS), ...
read it

Information State Embedding in Partially Observable Cooperative MultiAgent Reinforcement Learning
Multiagent reinforcement learning (MARL) under partial observability ha...
read it

Approximate Equilibrium Computation for DiscreteTime LinearQuadratic MeanField Games
While the topic of meanfield games (MFGs) has a relatively long history...
read it

Asynchronous Policy Evaluation in Distributed Reinforcement Learning over Networks
This paper proposes a fully asynchronous scheme for policy evaluation of...
read it

Decentralized MultiAgent Reinforcement Learning with Networked Agents: Recent Advances
Multiagent reinforcement learning (MARL) has long been a significant an...
read it

MultiAgent Reinforcement Learning: A Selective Overview of Theories and Algorithms
Recent years have witnessed significant advances in reinforcement learni...
read it

NonCooperative Inverse Reinforcement Learning
Making decisions in the presence of a strategic opponent requires one to...
read it

Policy Optimization for H_2 Linear Control with H_∞ Robustness Guarantee: Implicit Regularization and Global Convergence
Policy optimization (PO) is a key ingredient for reinforcement learning ...
read it

Online Planning for Decentralized Stochastic Control with Partial History Sharing
In decentralized stochastic control, standard approaches for sequential ...
read it

Stochastic Convergence Results for Regularized ActorCritic Methods
In this paper, we present a stochastic convergence proof, under suitable...
read it

A CommunicationEfficient MultiAgent ActorCritic Algorithm for Distributed Reinforcement Learning
This paper considers a distributed reinforcement learning problem in whi...
read it

Global Convergence of Policy Gradient Methods to (Almost) Locally Optimal Policies
Policy gradient (PG) methods are a widely used reinforcement learning me...
read it

Policy Optimization Provably Converges to Nash Equilibria in ZeroSum Linear Quadratic Games
We study the global convergence of policy optimization for finding the N...
read it

A MultiAgent OffPolicy ActorCritic Algorithm for Distributed Reinforcement Learning
This paper extends offpolicy reinforcement learning to the multiagent ...
read it

CommunicationEfficient Distributed Reinforcement Learning
This paper studies the distributed reinforcement learning (DRL) problem ...
read it

FiniteSample Analyses for Fully Decentralized MultiAgent Reinforcement Learning
Despite the increasing interest in multiagent reinforcement learning (M...
read it

Distributed Learning of Average Belief Over Networks Using Sequential Observations
This paper addresses the problem of distributed learning of average beli...
read it

Fully Decentralized MultiAgent Reinforcement Learning with Networked Agents
We consider the problem of fully decentralized multiagent reinforcement...
read it