On the Safety of Conversational Models: Taxonomy, Dataset, and Benchmark

10/16/2021
by   Hao Sun, et al.
0

Dialogue safety problems severely limit the real-world deployment of neural conversational models and attract great research interests recently. We propose a taxonomy for dialogue safety specifically designed to capture unsafe behaviors that are unique in human-bot dialogue setting, with focuses on context-sensitive unsafety, which is under-explored in prior works. To spur research in this direction, we compile DiaSafety, a dataset of 6 unsafe categories with rich context-sensitive unsafe examples. Experiments show that existing utterance-level safety guarding tools fail catastrophically on our dataset. As a remedy, we train a context-level dialogue safety classifier to provide a strong baseline for context-sensitive dialogue unsafety detection. With our classifier, we perform safety evaluations on popular conversational models and show that existing dialogue systems are still stuck in context-sensitive safety problems.

READ FULL TEXT

page 9

page 15

research
10/25/2019

Measuring Conversational Fluidity in Automated Dialogue Agents

We present an automated evaluation method to measure fluidity in convers...
research
12/16/2022

Werewolf Among Us: A Multimodal Dataset for Modeling Persuasion Behaviors in Social Deduction Games

Persuasion modeling is a key building block for conversational agents. E...
research
12/06/2022

Sources of Noise in Dialogue and How to Deal with Them

Training dialogue systems often entails dealing with noisy training exam...
research
10/14/2020

Recipes for Safety in Open-domain Chatbots

Models trained on large unlabeled corpora of human interactions will lea...
research
05/04/2023

A Survey on Proactive Dialogue Systems: Problems, Methods, and Prospects

Proactive dialogue systems, related to a wide range of real-world conver...
research
08/21/2020

Detecting and Classifying Malevolent Dialogue Responses: Taxonomy, Data and Methodology

Conversational interfaces are increasingly popular as a way of connectin...
research
07/03/2022

DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech

The majority of current TTS datasets, which are collections of individua...

Please sign up or login with your details

Forgot password? Click here to reset