Pedro Ortega | DeepAI

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Remi Munos
89 publications
Marcus Hutter
83 publications
Georgios Piliouras
78 publications
Karl Tuyls
50 publications
Marc Lanctot
44 publications
Shane Legg
41 publications
Mark Rowland
40 publications
Julien Perolat
34 publications
Peter Battaglia
33 publications
David Balduzzi
33 publications
Edward Hughes
30 publications

research

∙ 09/30/2022

Beyond Bayes-optimality: meta-learning what you know you don't know

Meta-training agents with memory has been shown to culminate in Bayes-op...

8 Jordi Grau-Moya, et al. ∙

research

∙ 03/23/2022

Your Policy Regularizer is Secretly an Adversary

Policy regularization methods such as maximum entropy regularization are...

0 Rob Brekelmans, et al. ∙

research

∙ 02/19/2020

From Poincaré Recurrence to Convergence in Imperfect Information Games: Finding Equilibrium via Regularization

In this paper we investigate the Follow the Regularized Leader dynamics ...

32 Julien Perolat, et al. ∙

research

∙ 01/23/2019

Causal Reasoning from Meta-reinforcement Learning

Discovering and exploiting the causal structure in the environment is a ...

0 Ishita Dasgupta, et al. ∙

Success!

An error occurred