Communication lays the foundation for cooperation in human society and i...
The congestion game is a powerful model that encompasses a range of
engi...
The difficulty of appropriately assigning credit is particularly heighte...
Reinforcement learning (RL) mimics how humans and animals interact with ...
Enhancing the diversity of policies is beneficial for robustness,
explor...
Learning Markov decision processes (MDP) in an adversarial environment h...
Offline multi-agent reinforcement learning (MARL) aims to learn effectiv...
Reinforcement learning (RL) has exceeded human performance in many synth...
The phenomenon of data distribution evolving over time has been observed...
We study the convergence of the actor-critic algorithm with nonlinear
fu...
Temporal difference (TD) learning is a widely used method to evaluate
po...
Recent studies have shown that introducing communication between agents ...
Improving the resilience of a network protects the system from natural
d...
Motivated by the common strategic activities in crowdsourcing labeling, ...
A k-submodular function is a function that given k disjoint subsets
outp...
Centralized Training with Decentralized Execution (CTDE) has been a popu...
This paper studies differential privacy (DP) and local
differential priv...
We study the problem of combinatorial multi-armed bandits (CMAB) under
s...
Let F be a multivariate function from a product set Σ^n to an
Abelian gr...
We analyze the Gambler's problem, a simple reinforcement learning proble...
Binary determination of the presence of objects is one of the problems w...
We consider privacy-preserving algorithms for deep reinforcement learnin...
In recent years, reinforcement learning (RL) methods have been applied t...
Reinforcement learning (RL) has had many successes in both "deep" and
"s...
Policy optimization on high-dimensional continuous control tasks exhibit...