Jordi Grau-Moya

research

∙ 09/19/2023

Language Modeling Is Compression

It has long been established that predictive models can be transformed i...

0 Grégoire Delétang, et al. ∙

research

∙ 05/26/2023

Randomized Positional Encodings Boost Length Generalization of Transformers

Transformers have impressive generalization capabilities on tasks with a...

0 Anian Ruoss, et al. ∙

research

∙ 02/06/2023

Memory-Based Meta-Learning on Non-Stationary Distributions

Memory-based meta-learning is a technique for approximating Bayes-optima...

0 Tim Genewein, et al. ∙

research

∙ 09/30/2022

Beyond Bayes-optimality: meta-learning what you know you don't know

Meta-training agents with memory has been shown to culminate in Bayes-op...

8 Jordi Grau-Moya, et al. ∙

research

∙ 07/05/2022

Neural Networks and the Chomsky Hierarchy

Reliable generalization lies at the heart of safe ML and AI. However, un...

3 Grégoire Delétang, et al. ∙

research

∙ 03/23/2022

Your Policy Regularizer is Secretly an Adversary

Policy regularization methods such as maximum entropy regularization are...

0 Rob Brekelmans, et al. ∙

research

∙ 11/04/2021

Model-Free Risk-Sensitive Reinforcement Learning

We extend temporal-difference (TD) learning in order to obtain risk-sens...

9 Grégoire Delétang, et al. ∙

research

∙ 10/20/2021

Shaking the foundations: delusions in sequence models for interaction and control

The recent phenomenal success of language models has reinvigorated machi...

68 Pedro A. Ortega, et al. ∙

research

∙ 03/26/2021

Bellman: A Toolbox for Model-Based Reinforcement Learning in TensorFlow

In the past decade, model-free reinforcement learning (RL) has provided ...

0 John McLeod, et al. ∙

research

∙ 03/05/2021

Causal Analysis of Agent Behavior for AI Safety

As machine learning systems become more powerful they also become increa...

26 Grégoire Delétang, et al. ∙

research

∙ 09/11/2019

Mutual-Information Regularization in Markov Decision Processes and Actor-Critic Learning

Cumulative entropy regularization introduces a regulatory signal to the ...

0 Felix Leibfried, et al. ∙

research

∙ 07/26/2019

A Unified Bellman Optimality Principle Combining Reward Maximization and Empowerment

Empowerment is an information-theoretic method that can be used to intri...

1 Felix Leibfried, et al. ∙

research

∙ 06/21/2019

Disentangled Skill Embeddings for Reinforcement Learning

We propose a novel framework for multi-task reinforcement learning (MTRL...

1 Janith C. Petangoda, et al. ∙

research

∙ 02/09/2018

Balancing Two-Player Stochastic Games with Soft Q-Learning

Within the context of video games the notion of perfectly rational agent...

0 Jordi Grau-Moya, et al. ∙

research

∙ 08/06/2017

An Information-Theoretic Optimality Principle for Deep Reinforcement Learning

In this paper, we methodologically address the problem of cumulative rew...

0 Felix Leibfried, et al. ∙

research

∙ 04/07/2016

Planning with Information-Processing Constraints and Model Uncertainty in Markov Decision Processes

Information-theoretic principles for learning and acting have been propo...

0 Jordi Grau-Moya, et al. ∙

research

∙ 11/05/2015

Adaptive information-theoretic bounded rational decision-making with parametric priors

Deviations from rational decision-making due to limited computational re...

0 Jordi Grau-Moya, et al. ∙

research

∙ 12/24/2013

Bounded Rational Decision-Making in Changing Environments

A perfectly rational decision-maker chooses the best action with the hig...

0 Jordi Grau-Moya, et al. ∙

research

∙ 06/09/2012

A Nonparametric Conjugate Prior Distribution for the Maximizing Argument of a Noisy Function

We propose a novel Bayesian approach to solve stochastic optimization pr...

0 Pedro A. Ortega, et al. ∙

Jordi Grau-Moya

Featured Co-authors

Sign in with Google

Consider DeepAI Pro