Scalable pragmatic communication via self-supervision

08/12/2021
by   Jennifer Hu, et al.
3

Models of context-sensitive communication often use the Rational Speech Act framework (RSA; Frank Goodman, 2012), which formulates listeners and speakers in a cooperative reasoning process. However, the standard RSA formulation can only be applied to small domains, and large-scale applications have relied on imitating human behavior. Here, we propose a new approach to scalable pragmatics, building upon recent theoretical results (Zaslavsky et al., 2020) that characterize pragmatic reasoning in terms of general information-theoretic principles. Specifically, we propose an architecture and learning process in which agents acquire pragmatic policies via self-supervision instead of imitating human data. This work suggests a new principled approach for equipping artificial agents with pragmatic skills via self-supervision, which is grounded both in pragmatic theory and in information theory.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/13/2020

A Rate-Distortion view of human pragmatic reasoning

What computational principles underlie human pragmatic reasoning? A prom...
research
05/17/2023

Pragmatic Reasoning in Structured Signaling Games

In this work we introduce a structured signaling game, an extension of t...
research
05/31/2020

Learning to refer informatively by amortizing pragmatic reasoning

A hallmark of human language is the ability to effectively and efficient...
research
05/20/2021

A practical introduction to the Rational Speech Act modeling framework

Recent advances in computational cognitive science (i.e., simulation-bas...
research
05/16/2017

Cooperative Learning with Visual Attributes

Learning paradigms involving varying levels of supervision have received...
research
10/28/2011

Anthropic decision theory

This paper sets out to resolve how agents ought to act in the Sleeping B...
research
09/13/2016

Self-Sustaining Iterated Learning

An important result from psycholinguistics (Griffiths & Kalish, 2005) st...

Please sign up or login with your details

Forgot password? Click here to reset