
Nearoptimal Bayesian Solution For Unknown Discrete Markov Decision Process
We tackle the problem of acting in an unknown finite and discrete Markov...
Nearoptimal Reinforcement Learning using Bayesian Quantiles
We study modelbased reinforcement learning in finite communicating Mark...
NearOptimal Online Egalitarian learning in General Sum Repeated Matrix Games
We study twoplayer general sum repeated finite games where the rewards ...
Differential Privacy for Multiarmed Bandits: What Is It and What Is Its Cost?
We introduce a number of privacy definitions for the multiarmed bandit ...
Nearoptimal Optimistic Reinforcement Learning using Empirical Bernstein Inequalities
We study modelbased reinforcement learning in an unknown finite communi...
Algorithms for Differentially Private MultiArmed Bandits
We present differentially private algorithms for the stochastic MultiAr...
Probabilistic inverse reinforcement learning in unknown environments
We consider the problem of learning by demonstration from agents acting ...
Aristide Tossou
