DeepAI AI Chat
Log In Sign Up

Stylistic Dialogue Generation via Information-Guided Reinforcement Learning Strategy

04/05/2020
by   Yixuan Su, et al.
0

Stylistic response generation is crucial for building an engaging dialogue system for industrial use. While it has attracted much research interest, existing methods often generate stylistic responses at the cost of the content quality (relevance and fluency). To enable better balance between the content quality and the style, we introduce a new training strategy, know as Information-Guided Reinforcement Learning (IG-RL). In IG-RL, a training model is encouraged to explore stylistic expressions while being constrained to maintain its content quality. This is achieved by adopting reinforcement learning strategy with statistical style information guidance for quality-preserving explorations. Experiments on two datasets show that the proposed approach outperforms several strong baselines in terms of the overall response performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

04/05/2020

Prototype-to-Style: Dialogue Generation with Style-Aware Editing on Retrieval Memory

The ability of a dialog system to express prespecified language style du...
05/08/2018

Polite Dialogue Generation Without Parallel Data

Stylistic dialogue response generation, with valuable applications in pe...
06/10/2021

Synthesizing Adversarial Negative Responses for Robust Response Ranking and Evaluation

Open-domain neural dialogue models have achieved high performance in res...
04/18/2022

CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement Learning

Conventionally, generation of natural language for dialogue agents may b...
05/19/2022

Target-Guided Dialogue Response Generation Using Commonsense and Data Augmentation

Target-guided response generation enables dialogue systems to smoothly t...
05/24/2020

GoChat: Goal-oriented Chatbots with Hierarchical Reinforcement Learning

A chatbot that converses like a human should be goal-oriented (i.e., be ...
09/12/2021

Stylistic Retrieval-based Dialogue System with Unparallel Training Data

The ability of a dialog system to express consistent language style duri...