Automatically Exposing Problems with Neural Dialog Models

09/14/2021
by   Dian Yu, et al.
0

Neural dialog models are known to suffer from problems such as generating unsafe and inconsistent responses. Even though these problems are crucial and prevalent, they are mostly manually identified by model designers through interactions. Recently, some research instructs crowdworkers to goad the bots into triggering such problems. However, humans leverage superficial clues such as hate speech, while leaving systematic problems undercover. In this paper, we propose two methods including reinforcement learning to automatically trigger a dialog model into generating problematic responses. We show the effect of our methods in exposing safety and contradiction issues with state-of-the-art dialog models.

READ FULL TEXT
research
06/01/2018

Audio Visual Scene-Aware Dialog (AVSD) Challenge at DSTC7

Scene-aware dialog systems will be able to have conversations with users...
research
12/06/2017

Why Do Neural Dialog Systems Generate Short and Meaningless Replies? A Comparison between Dialog and Translation

This paper addresses the question: Why do neural dialog systems generate...
research
08/15/2019

A Multi-Turn Emotionally Engaging Dialog Model

Open-domain dialog systems (also known as chatbots) have increasingly dr...
research
06/15/2021

Unsupervised Enrichment of Persona-grounded Dialog with Background Stories

Humans often refer to personal narratives, life experiences, and events ...
research
04/30/2017

A Conditional Variational Framework for Dialog Generation

Deep latent variable models have been shown to facilitate the response g...
research
05/18/2017

ParlAI: A Dialog Research Software Platform

We introduce ParlAI (pronounced "par-lay"), an open-source software plat...
research
06/12/2017

Dialog Structure Through the Lens of Gender, Gender Environment, and Power

Understanding how the social context of an interaction affects our dialo...

Please sign up or login with your details

Forgot password? Click here to reset