Enhance Incomplete Utterance Restoration by Joint Learning Token Extraction and Text Generation

04/08/2022
by   Shumpei Inoue, et al.
0

This paper introduces a model for incomplete utterance restoration (IUR). Different from prior studies that only work on extraction or abstraction datasets, we design a simple but effective model, working for both scenarios of IUR. Our design simulates the nature of IUR, where omitted tokens from the context contribute to restoration. From this, we construct a Picker that identifies the omitted tokens. To support the picker, we design two label creation methods (soft and hard labels), which can work in cases of no annotation of the omitted tokens. The restoration is done by using a Generator with the help of the Picker on joint learning. Promising results on four benchmark datasets in extraction and abstraction scenarios show that our model is better than the pretrained T5 and non-generative language model methods in both rich and limited training data settings. The code will be also available.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/03/2023

Mining Clues from Incomplete Utterance: A Query-enhanced Network for Incomplete Utterance Rewriting

Incomplete utterance rewriting has recently raised wide attention. Howev...
research
03/26/2020

TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation

Natural Language Generation (NLG) models are prone to generating repetit...
research
09/22/2017

Sentence Correction Based on Large-scale Language Modelling

With the further development of informatization, more and more data is s...
research
07/08/2023

Incomplete Utterance Rewriting as Sequential Greedy Tagging

The task of incomplete utterance rewriting has recently gotten much atte...
research
09/20/2020

F^2-Softmax: Diversifying Neural Text Generation via Frequency Factorized Softmax

Despite recent advances in neural text generation, encoding the rich div...
research
05/16/2021

Doc2Dict: Information Extraction as Text Generation

Typically, information extraction (IE) requires a pipeline approach: fir...
research
01/28/2022

Blue Ceramics: Co-designing Morphing Ceramics for Seagrass Meadow Restoration

Seagrass meadows are twice as efficient as forests at capturing and stor...

Please sign up or login with your details

Forgot password? Click here to reset