Countering Language Drift with Seeded Iterated Learning

03/28/2020
by   Yuchen Lu, et al.
1

Supervised learning methods excel at capturing statistical properties of language when trained over large text corpora. Yet, these models often produce inconsistent outputs in goal-oriented language settings as they are not trained to complete the underlying task. Moreover, as soon as the agents are finetuned to maximize task completion, they suffer from the so-called language drift phenomenon: they slowly lose syntactic and semantic properties of language as they only focus on solving the task. In this paper, we propose a generic approach to counter language drift by using iterated learning. We iterate between fine-tuning agents with interactive training steps, and periodically replacing them with new agents that are seeded from last iteration and trained to imitate the latest finetuned models. Iterated learning does not require external syntactic constraint nor semantic knowledge, making it a valuable task-agnostic finetuning protocol. We first explore iterated learning in the Lewis Game. We then scale-up the approach in the translation game. In both settings, our results show that iterated learn-ing drastically counters language drift as well as it improves the task completion metric.

READ FULL TEXT

page 5

page 6

research
10/06/2020

Supervised Seeded Iterated Learning for Interactive Language Learning

Language drift has been one of the major obstacles to train language mod...
research
04/15/2021

Multitasking Inhibits Semantic Drift

When intelligent agents communicate to accomplish shared goals, how do t...
research
09/10/2019

Countering Language Drift via Visual Grounding

Emergent multi-agent communication protocols are very different from nat...
research
02/23/2022

Using natural language prompts for machine translation

We explore the use of natural language prompts for controlling various a...
research
04/03/2022

A Computational Analysis of Pitch Drift in Unaccompanied Solo Singing using DBSCAN Clustering

Unaccompanied vocalists usually change the tuning unintentionally and en...
research
05/26/2023

Characterizing and Measuring Linguistic Dataset Drift

NLP models often degrade in performance when real world data distributio...
research
03/23/2022

ThingTalk: An Extensible, Executable Representation Language for Task-Oriented Dialogues

Task-oriented conversational agents rely on semantic parsers to translat...

Please sign up or login with your details

Forgot password? Click here to reset