Overcoming Referential Ambiguity in Language-Guided Goal-Conditioned Reinforcement Learning

09/26/2022
by   Hugo Caselles-Dupré, et al.
0

Teaching an agent to perform new tasks using natural language can easily be hindered by ambiguities in interpretation. When a teacher provides an instruction to a learner about an object by referring to its features, the learner can misunderstand the teacher's intentions, for instance if the instruction ambiguously refer to features of the object, a phenomenon called referential ambiguity. We study how two concepts derived from cognitive sciences can help resolve those referential ambiguities: pedagogy (selecting the right instructions) and pragmatism (learning the preferences of the other agents using inductive reasoning). We apply those ideas to a teacher/learner setup with two artificial agents on a simulated robotic task (block-stacking). We show that these concepts improve sample efficiency for training the learner.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/02/2019

Learner-aware Teaching: Inverse Reinforcement Learning with Preferences and Constraints

Inverse reinforcement learning (IRL) enables an agent to learn complex b...
research
10/21/2017

Towards Black-box Iterative Machine Teaching

In this paper, we make an important step towards the black-box machine t...
research
04/07/2023

Embodied Concept Learner: Self-supervised Learning of Concepts and Mapping through Instruction Following

Humans, even at a very early age, can learn visual concepts and understa...
research
06/09/2022

Pragmatically Learning from Pedagogical Demonstrations in Multi-Goal Environments

Learning from demonstration methods usually leverage close to optimal de...
research
10/17/2017

Interactively Picking Real-World Objects with Unconstrained Spoken Language Instructions

Comprehension of spoken natural language is an essential component for r...
research
09/18/2019

On the Right Path: A Modal Logic for Supervised Learning

Formal learning theory formalizes the process of inferring a general res...
research
02/07/2023

Learning Manner of Execution from Partial Corrections

Some actions must be executed in different ways depending on the context...

Please sign up or login with your details

Forgot password? Click here to reset