Hierarchical Context Tagging for Utterance Rewriting

06/22/2022
by   Lisa Jin, et al.
0

Utterance rewriting aims to recover coreferences and omitted information from the latest turn of a multi-turn dialogue. Recently, methods that tag rather than linearly generate sequences have proven stronger in both in- and out-of-domain rewriting settings. This is due to a tagger's smaller search space as it can only copy tokens from the dialogue context. However, these methods may suffer from low coverage when phrases that must be added to a source utterance cannot be covered by a single context span. This can occur in languages like English that introduce tokens such as prepositions into the rewrite for grammaticality. We propose a hierarchical context tagger (HCT) that mitigates this issue by predicting slotted rules (e.g., "besides _") whose slots are later filled with context spans. HCT (i) tags the source string with token-level edit actions and slotted rules and (ii) fills in the resulting rule slots with spans from the dialogue context. This rule tagging allows HCT to add out-of-context tokens and multiple spans at once; we further cluster the rules to truncate the long tail of the rule distribution. Experiments on several benchmarks show that HCT can outperform state-of-the-art rewriting systems by  2 BLEU points.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/29/2020

Robust Dialogue Utterance Rewriting as Sequence Tagging

The task of dialogue rewriting aims to reconstruct the latest dialogue u...
research
03/22/2022

Utterance Rewriting with Contrastive Learning in Multi-turn Dialogue

Context modeling plays a significant role in building multi-turn dialogu...
research
04/07/2020

Transformers to Learn Hierarchical Contexts in Multiparty Dialogue for Span-based Question Answering

We introduce a novel approach to transformers that learns hierarchical r...
research
06/14/2019

Improving Multi-turn Dialogue Modelling with Utterance ReWriter

Recent research has made impressive progress in single-turn dialogue mod...
research
09/26/2021

BioCopy: A Plug-And-Play Span Copy Mechanism in Seq2Seq Models

Copy mechanisms explicitly obtain unchanged tokens from the source (inpu...
research
03/03/2020

Hierarchical Context Enhanced Multi-Domain Dialogue System for Multi-domain Task Completion

Task 1 of the DSTC8-track1 challenge aims to develop an end-to-end multi...
research
07/03/2023

Mining Clues from Incomplete Utterance: A Query-enhanced Network for Incomplete Utterance Rewriting

Incomplete utterance rewriting has recently raised wide attention. Howev...

Please sign up or login with your details

Forgot password? Click here to reset