A Model-Agnostic Data Manipulation Method for Persona-based Dialogue Generation

04/21/2022
by   Yu Cao, et al.
0

Towards building intelligent dialogue agents, there has been a growing interest in introducing explicit personas in generation models. However, with limited persona-based dialogue data at hand, it may be difficult to train a dialogue generation model well. We point out that the data challenges of this generation task lie in two aspects: first, it is expensive to scale up current persona-based dialogue datasets; second, each data sample in this task is more complex to learn with than conventional dialogue data. To alleviate the above data issues, we propose a data manipulation method, which is model-agnostic to be packed with any persona-based dialogue generation model to improve its performance. The original training samples will first be distilled and thus expected to be fitted more easily. Next, we show various effective ways that can diversify such easier distilled data. A given base model will then be trained via the constructed data curricula, i.e. first on augmented distilled samples and then on original ones. Experiments illustrate the superiority of our method with two strong base dialogue models (Transformer encoder-decoder and GPT2).

READ FULL TEXT

page 16

page 17

research
04/06/2020

Data Manipulation: Towards Effective Instance Learning for Neural Dialogue Generation via Learning to Augment and Reweight

Current state-of-the-art neural dialogue models learn from human convers...
research
06/11/2021

BoB: BERT Over BERT for Training Persona-based Dialogue Models from Limited Personalized Data

Maintaining consistent personas is essential for dialogue agents. Althou...
research
02/20/2017

Latent Variable Dialogue Models and their Diversity

We present a dialogue generation model that directly captures the variab...
research
09/15/2022

Stateful Memory-Augmented Transformers for Dialogue Modeling

Transformer encoder-decoder models have shown impressive performance in ...
research
10/20/2018

Improving Context Modelling in Multimodal Dialogue Generation

In this work, we investigate the task of textual response generation in ...
research
01/22/2018

Personalizing Dialogue Agents: I have a dog, do you have pets too?

Chit-chat models are known to have several problems: they lack specifici...
research
09/14/2021

Identifying Untrustworthy Samples: Data Filtering for Open-domain Dialogues with Bayesian Optimization

Being able to reply with a related, fluent, and informative response is ...

Please sign up or login with your details

Forgot password? Click here to reset