DeepAI AI Chat
Log In Sign Up

Stylistic Dialogue Generation via Information-Guided Reinforcement Learning Strategy

by   Yixuan Su, et al.

Stylistic response generation is crucial for building an engaging dialogue system for industrial use. While it has attracted much research interest, existing methods often generate stylistic responses at the cost of the content quality (relevance and fluency). To enable better balance between the content quality and the style, we introduce a new training strategy, know as Information-Guided Reinforcement Learning (IG-RL). In IG-RL, a training model is encouraged to explore stylistic expressions while being constrained to maintain its content quality. This is achieved by adopting reinforcement learning strategy with statistical style information guidance for quality-preserving explorations. Experiments on two datasets show that the proposed approach outperforms several strong baselines in terms of the overall response performance.


page 1

page 2

page 3

page 4


Prototype-to-Style: Dialogue Generation with Style-Aware Editing on Retrieval Memory

The ability of a dialog system to express prespecified language style du...

Polite Dialogue Generation Without Parallel Data

Stylistic dialogue response generation, with valuable applications in pe...

Synthesizing Adversarial Negative Responses for Robust Response Ranking and Evaluation

Open-domain neural dialogue models have achieved high performance in res...

CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement Learning

Conventionally, generation of natural language for dialogue agents may b...

Target-Guided Dialogue Response Generation Using Commonsense and Data Augmentation

Target-guided response generation enables dialogue systems to smoothly t...

GoChat: Goal-oriented Chatbots with Hierarchical Reinforcement Learning

A chatbot that converses like a human should be goal-oriented (i.e., be ...

Stylistic Retrieval-based Dialogue System with Unparallel Training Data

The ability of a dialog system to express consistent language style duri...