Know your audience: specializing grounded language models with the game of Dixit

06/16/2022
by   Aaditya K. Singh, et al.
0

Effective communication requires adapting to the idiosyncratic common ground shared with each communicative partner. We study a particularly challenging instantiation of this problem: the popular game Dixit. We formulate a round of Dixit as a multi-agent image reference game where a (trained) speaker model is rewarded for describing a target image such that one (pretrained) listener model can correctly identify it from a pool of distractors, but another listener cannot. To adapt to this setting, the speaker must exploit differences in the common ground it shares with the different listeners. We show that finetuning an attention-based adapter between a CLIP vision encoder and a large language model in this contrastive, multi-agent setting gives rise to context-dependent natural language specialization from rewards only, without direct supervision. In a series of controlled experiments, we show that the speaker can adapt according to the idiosyncratic strengths and weaknesses of various pairs of different listeners. Furthermore, we show zero-shot transfer of the speaker's specialization to unseen real-world data. Our experiments offer a step towards adaptive communication in complex multi-partner settings and highlight the interesting research challenges posed by games like Dixit. We hope that our work will inspire creative new approaches to adapting pretrained models.

READ FULL TEXT

page 24

page 25

page 26

page 27

research
11/30/2021

An implementation of the "Guess who?" game using CLIP

CLIP (Contrastive Language-Image Pretraining) is an efficient method for...
research
05/14/2020

Multi-agent Communication meets Natural Language: Synergies between Functional and Structural Language Learning

We present a method for combining multi-agent communication and traditio...
research
04/20/2020

A Practical Guide to Studying Emergent Communication through Grounded Language Games

The question of how an effective and efficient communication system can ...
research
06/13/2019

Know What You Don't Know: Modeling a Pragmatic Speaker that Refers to Objects of Unknown Categories

Zero-shot learning in Language & Vision is the task of correctly labelli...
research
08/20/2023

Towards Few-shot Coordination: Revisiting Ad-hoc Teamplay Challenge In the Game of Hanabi

Cooperative Multi-agent Reinforcement Learning (MARL) algorithms with Ze...
research
10/10/2019

Modeling Conceptual Understanding in Image Reference Games

An agent who interacts with a wide population of other agents needs to b...
research
12/16/2019

Characterizing the dynamics of learning in repeated reference games

The language we use over the course of conversation changes as we establ...

Please sign up or login with your details

Forgot password? Click here to reset