research
∙
06/05/2020
Logical Team Q-learning: An approach towards factored policies in cooperative MARL
We address the challenge of learning factored policies in cooperative MA...
research
∙
09/13/2019
ISL: Optimal Policy Learning With Optimal Exploration-Exploitation Trade-Off
Traditionally, off-policy learning algorithms (such as Q-learning) and e...
research
∙
10/17/2018
Multi-Agent Fully Decentralized Value Function Learning with Linear Convergence Rates
This work develops a fully decentralized multi-agent algorithm for polic...
research
∙
10/17/2018