Measuring the `I don't know' Problem through the Lens of Gricean Quantity

10/24/2020
by   Huda Khayrallah, et al.
0

We consider the intrinsic evaluation of neural generative dialog models through the lens of Grices Maxims of Conversation (1975). Based on the maxim of Quantity (be informative), we propose Relative Utterance Quantity (RUQ) to diagnose the `I don't know' problem. The RUQ diagnostic compares the model score of a generic response to that of the reference response. We find that for reasonable baseline models, `I don't know' is preferred over the reference more than half the time, but this can be mitigated with hyperparameter tuning.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/30/2019

Towards Coherent and Engaging Spoken Dialog Response Generation Using Automatic Conversation Evaluators

Encoder-decoder based neural architectures serve as the basis of state-o...
research
09/10/2020

Do Response Selection Models Really Know What's Next? Utterance Manipulation Strategies for Multi-turn Response Selection

In this paper, we study the task of selecting optimal response given use...
research
08/15/2019

A Multi-Turn Emotionally Engaging Dialog Model

Open-domain dialog systems (also known as chatbots) have increasingly dr...
research
07/19/2023

Selection functions of strong lens finding neural networks

Convolution Neural Networks trained for the task of lens finding with si...
research
09/05/2018

Neural MultiVoice Models for Expressing Novel Personalities in Dialog

Natural language generators for task-oriented dialog should be able to v...
research
12/21/2020

A Graph Reasoning Network for Multi-turn Response Selection via Customized Pre-training

We investigate response selection for multi-turn conversation in retriev...
research
10/05/2020

Non-trivial informational closure of a Bayesian hyperparameter

We investigate the non-trivial informational closure (NTIC) of a Bayesia...

Please sign up or login with your details

Forgot password? Click here to reset