Multi-Advisor Reinforcement Learning

04/03/2017
by   Romain Laroche, et al.
0

We consider tackling a single-agent RL problem by distributing it to n learners. These learners, called advisors, endeavour to solve the problem from a different focus. Their advice, taking the form of action values, is then communicated to an aggregator, which is in control of the system. We show that the local planning method for the advisors is critical and that none of the ones found in the literature is flawless: the egocentric planning overestimates values of states where the other advisors disagree, and the agnostic planning is inefficient around danger zones. We introduce a novel approach called empathic and discuss its theoretical aspects. We empirically examine and validate our theoretical findings on a fruit collection task.

READ FULL TEXT
research
04/26/2018

Action Categorization for Computationally Improved Task Learning and Planning

This paper explores the problem of task learning and planning, contribut...
research
11/26/2019

Control-Tutored Reinforcement Learning: an application to the Herding Problem

In this extended abstract we introduce a novel control-tutored Q-learni...
research
05/20/2019

Perceptual Values from Observation

Imitation by observation is an approach for learning from expert demonst...
research
12/30/2022

POMRL: No-Regret Learning-to-Plan with Increasing Horizons

We study the problem of planning under model uncertainty in an online me...
research
06/16/2022

Understanding Decision-Time vs. Background Planning in Model-Based Reinforcement Learning

In model-based reinforcement learning, an agent can leverage a learned m...
research
05/03/2019

Meta-learners' learning dynamics are unlike learners'

Meta-learning is a tool that allows us to build sample-efficient learnin...
research
10/18/2021

Goal Agnostic Planning using Maximum Likelihood Paths in Hypergraph World Models

In this paper, we present a hypergraph–based machine learning algorithm,...

Please sign up or login with your details

Forgot password? Click here to reset