Effects of Naturalistic Variation in Goal-Oriented Dialog

10/05/2020
by   Jatin Ganhotra, et al.
0

Existing benchmarks used to evaluate the performance of end-to-end neural dialog systems lack a key component: natural variation present in human conversations. Most datasets are constructed through crowdsourcing, where the crowd workers follow a fixed template of instructions while enacting the role of a user/agent. This results in straight-forward, somewhat routine, and mostly trouble-free conversations, as crowd workers do not think to represent the full range of actions that occur naturally with real users. In this work, we investigate the impact of naturalistic variation on two goal-oriented datasets: bAbI dialog task and Stanford Multi-Domain Dataset (SMD). We also propose new and more effective testbeds for both datasets, by introducing naturalistic variation by the user. We observe that there is a significant drop in performance (more than 60 bAbI task) of recent state-of-the-art end-to-end neural methods such as BossNet and GLMP on both datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/20/2020

Simulated Chats for Task-oriented Dialog: Learning to Generate Conversations from Instructions

Popular task-oriented dialog data sets such as MultiWOZ (Budzianowski et...
research
08/24/2018

Learning End-to-End Goal-Oriented Dialog with Multiple Answers

In a dialog, there can be multiple valid next utterances at any point. T...
research
09/01/2019

Taskmaster-1: Toward a Realistic and Diverse Dialog Dataset

A significant barrier to progress in data-driven approaches to building ...
research
08/16/2022

TexPrax: A Messaging Application for Ethical, Real-time Data Collection and Annotation

Collecting and annotating task-oriented dialog data is difficult, especi...
research
12/10/2018

Chat-crowd: A Dialog-based Platform for Visual Layout Composition

In this paper we introduce Chat-crowd, an interactive environment for vi...
research
05/15/2020

Is Your Goal-Oriented Dialog Model Performing Really Well? Empirical Analysis of System-wise Evaluation

There is a growing interest in developing goal-oriented dialog systems w...
research
12/23/2020

TicketTalk: Toward human-level performance with end-to-end, transaction-based dialog systems

We present a data-driven, end-to-end approach to transaction-based dialo...

Please sign up or login with your details

Forgot password? Click here to reset