Say What I Want: Towards the Dark Side of Neural Dialogue Models

09/13/2019
by   Haochen Liu, et al.
0

Neural dialogue models have been widely adopted in various chatbot applications because of their good performance in simulating and generalizing human conversations. However, there exists a dark side of these models -- due to the vulnerability of neural networks, a neural dialogue model can be manipulated by users to say what they want, which brings in concerns about the security of practical chatbot services. In this work, we investigate whether we can craft inputs that lead a well-trained black-box neural dialogue model to generate targeted outputs. We formulate this as a reinforcement learning (RL) problem and train a Reverse Dialogue Generator which efficiently finds such inputs for targeted outputs. Experiments conducted on a representative neural dialogue model show that our proposed model is able to discover such desired inputs in a considerable portion of cases. Overall, our work reveals this weakness of neural dialogue models and may prompt further researches of developing corresponding solutions to avoid it.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/27/2020

Chat as Expected: Learning to Manipulate Black-box Neural Dialogue Models

Recently, neural network based dialogue systems have become ubiquitous i...
research
04/18/2022

CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement Learning

Conventionally, generation of natural language for dialogue agents may b...
research
03/02/2020

Learning from Easy to Complex: Adaptive Multi-curricula Learning for Neural Dialogue Generation

Current state-of-the-art neural dialogue systems are mainly data-driven ...
research
09/11/2018

Detecting egregious responses in neural sequence-to-sequence models

In this work, we attempt to answer a critical question: whether there ex...
research
02/21/2023

Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management

Reinforcement learning (RL) has shown great promise for developing dialo...
research
12/06/2022

Sources of Noise in Dialogue and How to Deal with Them

Training dialogue systems often entails dealing with noisy training exam...

Please sign up or login with your details

Forgot password? Click here to reset