Topic-based Evaluation for Conversational Bots

01/11/2018
by   Fenfei Guo, et al.
0

Dialog evaluation is a challenging problem, especially for non task-oriented dialogs where conversational success is not well-defined. We propose to evaluate dialog quality using topic-based metrics that describe the ability of a conversational bot to sustain coherent and engaging conversations on a topic, and the diversity of topics that a bot can handle. To detect conversation topics per utterance, we adopt Deep Average Networks (DAN) and train a topic classifier on a variety of question and query data categorized into multiple topics. We propose a novel extension to DAN by adding a topic-word attention table that allows the system to jointly capture topic keywords in an utterance and perform topic classification. We compare our proposed topic based metrics with the ratings provided by users and show that our metrics both correlate with and complement human judgment. Our analysis is performed on tens of thousands of real human-bot dialogs from the Alexa Prize competition and highlights user expectations for conversational bots.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/18/2018

Contextual Topic Modeling For Dialog Systems

Accurate prediction of conversation topics can be a valuable signal for ...
research
02/04/2014

Topic Segmentation and Labeling in Asynchronous Conversations

Topic segmentation and labeling is often considered a prerequisite for h...
research
01/16/2023

Distinguish Sense from Nonsense: Out-of-Scope Detection for Virtual Assistants

Out of Scope (OOS) detection in Conversational AI solutions enables a ch...
research
04/29/2020

Topic Propagation in Conversational Search

In a conversational context, a user expresses her multi-faceted informat...
research
09/09/2021

TIAGE: A Benchmark for Topic-Shift Aware Dialog Modeling

Human conversations naturally evolve around different topics and fluentl...
research
05/28/2020

Would you Like to Talk about Sports Now? Towards Contextual Topic Suggestion for Open-Domain Conversational Agents

To hold a true conversation, an intelligent agent should be able to occa...
research
04/04/2019

Topic Spotting using Hierarchical Networks with Self Attention

Success of deep learning techniques have renewed the interest in develop...

Please sign up or login with your details

Forgot password? Click here to reset