Counterfactual Off-Policy Training for Neural Response Generation

04/29/2020
by   Qingfu Zhu, et al.
0

Learning a neural response generation model on data synthesized under the adversarial training framework helps to explore more possible responses. However, most of the data synthesized de novo are of low quality due to the vast size of the response space. In this paper, we propose a counterfactual off-policy method to learn on a better synthesis of data. It takes advantage of a real response to infer an alternative that was not taken using a structural casual model. Learning on the counterfactual responses helps to explore the high-reward area of the response space. An empirical study on the DailyDialog dataset shows that our approach significantly outperforms the HRED model as well as the conventional adversarial training approaches.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/12/2018

Retrieval-Enhanced Adversarial Training for Neural Response Generation

Dialogue systems are usually built on either generation-based or retriev...
research
05/31/2021

Learning from Perturbations: Diverse and Informative Dialogue Generation with Inverse Adversarial Training

In this paper, we propose Inverse Adversarial Training (IAT) algorithm f...
research
04/30/2020

Boosting Naturalness of Language in Task-oriented Dialogues via Adversarial Training

The natural language generation (NLG) module in a task-oriented dialogue...
research
02/19/2019

A novel repetition normalized adversarial reward for headline generation

While reinforcement learning can effectively improve language generation...
research
04/26/2021

Auto Response Generation in Online Medical Chat Services

Telehealth helps to facilitate access to medical professionals by enabli...
research
09/16/2018

Generating Informative and Diverse Conversational Responses via Adversarial Information Maximization

Responses generated by neural conversational models tend to lack informa...
research
04/05/2019

Generate, Filter, and Rank: Grammaticality Classification for Production-Ready NLG Systems

Neural approaches to Natural Language Generation (NLG) have been promisi...

Please sign up or login with your details

Forgot password? Click here to reset