Bruno Gaujal

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Jean-Yves Le Boudec
20 publications
Nicolas Gast
16 publications
Chen Yan
11 publications
Jonatha Anselmi
5 publications
Louis-Sébastien Rebuffi
2 publications
Kimang Khun
2 publications
Romain Cravic
1 publication

research

∙ 08/04/2023

Learning Optimal Admission Control in Partially Observable Queueing Networks

We present an efficient reinforcement learning algorithm that learns the...

0 Jonatha Anselmi, et al. ∙

research

∙ 02/21/2023

Reinforcement Learning in a Birth and Death Process: Breaking the Dependence on the State Space

In this paper, we revisit the regret of undiscounted reinforcement learn...

0 Jonatha Anselmi, et al. ∙

research

∙ 01/13/2023

Decentralized model-free reinforcement learning in stochastic games with average-reward objective

We propose the first model-free algorithm that achieves low regret perfo...

0 Romain Cravic, et al. ∙

research

∙ 03/10/2022

Computing Whittle (and Gittins) Index in Subcubic Time

Whittle index is a generalization of Gittins index that provides very ef...

0 Nicolas Gast, et al. ∙

research

∙ 06/16/2021

Reinforcement Learning for Markovian Bandits: Is Posterior Sampling more Scalable than Optimism?

We study learning algorithms for the classical Markovian bandit problem ...

0 Nicolas Gast, et al. ∙

research

∙ 12/16/2020

Exponential Convergence Rate for the Asymptotic Optimality of Whittle Index Policy

We evaluate the performance of Whittle index policy for restless Markovi...

0 Nicolas Gast, et al. ∙

research

∙ 04/14/2010

Mean field for Markov Decision Processes: from Discrete to Continuous Optimization

We study the convergence of Markov Decision Processes made of a large nu...

0 Nicolas Gast, et al. ∙

Success!

An error occurred

Bruno Gaujal

Featured Co-authors

Learning Optimal Admission Control in Partially Observable Queueing Networks

Reinforcement Learning in a Birth and Death Process: Breaking the Dependence on the State Space

Decentralized model-free reinforcement learning in stochastic games with average-reward objective

Computing Whittle (and Gittins) Index in Subcubic Time

Reinforcement Learning for Markovian Bandits: Is Posterior Sampling more Scalable than Optimism?

Exponential Convergence Rate for the Asymptotic Optimality of Whittle Index Policy

Mean field for Markov Decision Processes: from Discrete to Continuous Optimization

Sign in with Google

Consider DeepAI Pro