
Learning Zerosum Stochastic Games with Posterior Sampling
In this paper, we propose Posterior Sampling Reinforcement Learning for ...
A relaxed technical assumption for posterior samplingbased reinforcement learning for control of unknown linear systems
We revisit the Thompson sampling algorithm to control an unknown linear ...
Scalable regret for learning to control networkcoupled subsystems with unknown dynamics
We consider the problem of controlling an unknown linear quadratic Gauss...
Optimal communication and control strategies in a multiagent MDP problem
The problem of controlling multiagent systems under different models of...
Online Learning for Unknown Partially Observable MDPs
Solving Partially Observable Markov Decision Processes (POMDPs) is hard....
Dynamic Games among Teams with Delayed IntraTeam Information Sharing
We analyze a class of stochastic dynamic games among teams with asymmetr...
Common Information Belief based Dynamic Programs for Stochastic Zerosum Games with Competing Teams
Decentralized team problems where players have asymmetric information ab...
Thompson sampling for linear quadratic meanfield teams
We consider optimal control of an unknown multiagent linear quadratic (...
Optimal Dynamic Mechanism Design with Stochastic Supply and Flexible Consumers
We consider the problem of designing an expectedrevenue maximizing mech...
Testing for Anomalies: Active Strategies and Nonasymptotic Analysis
The problem of verifying whether a multicomponent system has anomalies ...
Regret Bounds for Decentralized Learning in Cooperative MultiAgent Dynamical Systems
Regret analysis is challenging in MultiAgent Reinforcement Learning (MA...
Fixedhorizon Active Hypothesis Testing
Two active hypothesis testing problems are formulated. In these problems...
Zerosum Stochastic Games with Asymmetric Information
A general model for zerosum stochastic games with asymmetric informatio...
Active Hypothesis Testing: Beyond ChernoffStein
An active hypothesis testing problem is formulated. In this problem, the...
Sequential Experiment Design for Hypothesis Verification
Hypothesis testing is an important problem with applications in target l...
Optimal Mechanism Design with Flexible Consumers and Costly Supply
The problem of designing a profitmaximizing, Bayesian incentive compati...
