Pragmatically Appropriate Diversity for Dialogue Evaluation

04/06/2023
by   Katherine Stasaski, et al.
0

Linguistic pragmatics state that a conversation's underlying speech acts can constrain the type of response which is appropriate at each turn in the conversation. When generating dialogue responses, neural dialogue agents struggle to produce diverse responses. Currently, dialogue diversity is assessed using automatic metrics, but the underlying speech acts do not inform these metrics. To remedy this, we propose the notion of Pragmatically Appropriate Diversity, defined as the extent to which a conversation creates and constrains the creation of multiple diverse responses. Using a human-created multi-response dataset, we find significant support for the hypothesis that speech acts provide a signal for the diversity of the set of next responses. Building on this result, we propose a new human evaluation task where creative writers predict the extent to which conversations inspire the creation of multiple diverse responses. Our studies find that writers' judgments align with the Pragmatically Appropriate Diversity of conversations. Our work suggests that expectations for diversity metric scores should vary depending on the speech act.

READ FULL TEXT

page 8

page 12

research
05/03/2022

Semantic Diversity in Dialogue with Natural Language Inference

Generating diverse, interesting responses to chitchat conversations is a...
research
10/11/2022

Measuring and Improving Semantic Diversity of Dialogue Generation

Response diversity has become an important criterion for evaluating the ...
research
05/07/2022

Towards a Progression-Aware Autonomous Dialogue Agent

Recent advances in large-scale language modeling and generation have ena...
research
10/05/2020

Regularizing Dialogue Generation by Imitating Implicit Scenarios

Human dialogues are scenario-based and appropriate responses generally r...
research
04/03/2023

The StatCan Dialogue Dataset: Retrieving Data Tables through Conversations with Genuine Intents

We introduce the StatCan Dialogue Dataset consisting of 19,379 conversat...
research
09/18/1998

Semantics and Conversations for an Agent Communication Language

We address the issues of semantics and conversations for agent communica...
research
07/16/2018

Don't get Lost in Negation: An Effective Negation Handled Dialogue Acts Prediction Algorithm for Twitter Customer Service Conversations

In the last several years, Twitter is being adopted by the companies as ...

Please sign up or login with your details

Forgot password? Click here to reset