Improving Dialogue Breakdown Detection with Semi-Supervised Learning

10/30/2020
by   Nathan Ng, et al.
0

Building user trust in dialogue agents requires smooth and consistent dialogue exchanges. However, agents can easily lose conversational context and generate irrelevant utterances. These situations are called dialogue breakdown, where agent utterances prevent users from continuing the conversation. Building systems to detect dialogue breakdown allows agents to recover appropriately or avoid breakdown entirely. In this paper we investigate the use of semi-supervised learning methods to improve dialogue breakdown detection, including continued pre-training on the Reddit dataset and a manifold-based data augmentation method. We demonstrate the effectiveness of these methods on the Dialogue Breakdown Detection Challenge (DBDC) English shared task. Our submissions to the 2020 DBDC5 shared task place first, beating baselines and other submissions by over 12% accuracy. In ablations on DBDC4 data from 2019, our semi-supervised learning methods improve the performance of a baseline BERT model by 2% accuracy. These methods are applicable generally to any dialogue task and provide a simple way to improve model performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/26/2019

Semi-supervised Bootstrapping of Dialogue State Trackers for Task Oriented Modelling

Dialogue systems benefit greatly from optimizing on detailed annotations...
research
05/30/2022

Duplex Conversation: Towards Human-like Interaction in Spoken Dialogue Systems

In this paper, we present Duplex Conversation, a multi-turn, multimodal ...
research
11/22/2015

Non-Sentential Utterances in Dialogue: Experiments in Classification and Interpretation

Non-sentential utterances (NSUs) are utterances that lack a complete sen...
research
01/31/2019

Shaping the Narrative Arc: An Information-Theoretic Approach to Collaborative Dialogue

We consider the problem of designing an artificial agent capable of inte...
research
09/22/2021

DialogueBERT: A Self-Supervised Learning based Dialogue Pre-training Encoder

With the rapid development of artificial intelligence, conversational bo...
research
02/11/2022

Dual Task Framework for Improving Persona-grounded Dialogue Dataset

This paper introduces a simple yet effective data-centric approach for t...
research
03/14/2022

Don't fear the unlabelled: safe deep semi-supervised learning via simple debiasing

Semi supervised learning (SSL) provides an effective means of leveraging...

Please sign up or login with your details

Forgot password? Click here to reset