Controllable Dialogue Simulation with In-Context Learning

10/09/2022
by   Zekun Li, et al.
8

Building dialogue systems requires a large corpus of annotated dialogues. Such datasets are usually created via crowdsourcing, which is expensive and time-consuming. In this paper, we propose a novel method for dialogue simulation based on language model in-context learning, dubbed as \textsc{Dialogic}. Seeded with a few annotated dialogues, \textsc{Dialogic} automatically selects in-context examples for demonstration and prompts GPT-3 to generate new dialogues and their annotations in a controllable way. Leveraging the strong in-context learning ability of GPT-3, our method can be used to rapidly expand a small set of dialogue data without requiring \textit{human involvement} or \textit{parameter update}, and is thus much more cost-efficient and time-saving than crowdsourcing. Experimental results on the MultiWOZ dataset demonstrate that training a model on the simulated dialogues leads to even better performance than using the same amount of human-generated dialogues in the low-resource settings, with as few as 85 dialogues as the seed data. Human evaluation results also show that our simulated dialogues has high language fluency and annotation accuracy. The code and data are available at \href{https://github.com/Leezekun/dialogic}{https://github.com/Leezekun/dialogic}.

READ FULL TEXT
research
05/23/2023

Generating Data for Symbolic Language with Large Language Models

While large language models (LLMs) bring not only performance but also c...
research
03/12/2020

CRWIZ: A Framework for Crowdsourcing Real-Time Wizard-of-Oz Dialogues

Large corpora of task-based and open-domain conversational dialogues are...
research
11/05/2019

LIDA: Lightweight Interactive Dialogue Annotator

Dialogue systems have the potential to change how people interact with m...
research
11/17/2021

MEDCOD: A Medically-Accurate, Emotive, Diverse, and Controllable Dialog System

We present MEDCOD, a Medically-Accurate, Emotive, Diverse, and Controlla...
research
06/28/2022

Simplifying Dataflow Dialogue Design

In <cit.>, a dataflow (DF) based dialogue system was introduced, showing...
research
04/06/2022

The Moral Integrity Corpus: A Benchmark for Ethical Dialogue Systems

Conversational agents have come increasingly closer to human competence ...
research
04/27/2015

Building Hierarchies of Concepts via Crowdsourcing

Hierarchies of concepts are useful in many applications from navigation ...

Please sign up or login with your details

Forgot password? Click here to reset