Large Scale Multi-Actor Generative Dialog Modeling

05/13/2020
by   Alex Boyd, et al.
0

Non-goal oriented dialog agents (i.e. chatbots) aim to produce varying and engaging conversations with a user; however, they typically exhibit either inconsistent personality across conversations or the average personality of all users. This paper addresses these issues by controlling an agent's persona upon generation via conditioning on prior conversations of a target actor. In doing so, we are able to utilize more abstract patterns within a person's speech and better emulate them in generated responses. This work introduces the Generative Conversation Control model, an augmented and fine-tuned GPT-2 language model that conditions on past reference conversations to probabilistically model multi-turn conversations in the actor's persona. We introduce an accompanying data collection procedure to obtain 10.3M conversations from 6 months worth of Reddit comments. We demonstrate that scaling model sizes from 117M to 8.3B parameters yields an improvement from 23.14 to 13.14 perplexity on 1.7M held out Reddit conversations. Increasing model scale yielded similar improvements in human evaluations that measure preference of model samples to the held out target distribution in terms of realism (31 style matching (37 conversation coherency (32 conversations improves perplexity by 0.47 in automatic evaluations. Through human trials we identify positive trends between conditional modeling and style matching and outline steps to further improve persona control.

READ FULL TEXT
research
09/20/2023

The Wizard of Curiosities: Enriching Dialogues with Fun Facts

Introducing curiosities in a conversation is a way to teach something ne...
research
10/20/2020

Local Knowledge Powered Conversational Agents

State-of-the-art conversational agents have advanced significantly in co...
research
09/22/2022

Prompting for a conversation: How to control a dialog model?

Dialog modelling faces a difficult trade-off. Models are trained on a la...
research
04/14/2023

OpenAssistant Conversations – Democratizing Large Language Model Alignment

Aligning large language models (LLMs) with human preferences has proven ...
research
02/26/2022

AugESC: Large-scale Data Augmentation for Emotional Support Conversation with Pre-trained Language Models

Crowd-sourcing is commonly adopted for dialog data collection. However, ...
research
07/13/2023

DIALGEN: Collaborative Human-LM Generated Dialogues for Improved Understanding of Human-Human Conversations

Applications that could benefit from automatic understanding of human-hu...
research
08/21/2023

Large Language Model as a User Simulator

The unparalleled performance of closed-sourced ChatGPT has sparked effor...

Please sign up or login with your details

Forgot password? Click here to reset