BioCopy: A Plug-And-Play Span Copy Mechanism in Seq2Seq Models

09/26/2021
by   YI LIU, et al.
2

Copy mechanisms explicitly obtain unchanged tokens from the source (input) sequence to generate the target (output) sequence under the neural seq2seq framework. However, most of the existing copy mechanisms only consider single word copying from the source sentences, which results in losing essential tokens while copying long spans. In this work, we propose a plug-and-play architecture, namely BioCopy, to alleviate the problem aforementioned. Specifically, in the training stage, we construct a BIO tag for each token and train the original model with BIO tags jointly. In the inference stage, the model will firstly predict the BIO tag at each time step, then conduct different mask strategies based on the predicted BIO label to diminish the scope of the probability distributions over the vocabulary list. Experimental results on two separate generative tasks show that they all outperform the baseline models by adding our BioCopy to the original model structure.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/20/2021

Tag, Copy or Predict: A Unified Weakly-Supervised Learning Framework for Visual Information Extraction using Sequences

Visual information extraction (VIE) has attracted increasing attention i...
research
06/08/2020

Copy that! Editing Sequences by Copying Spans

Neural sequence-to-sequence models are finding increasing use in editing...
research
07/06/2018

Sequential Copying Networks

Copying mechanism shows effectiveness in sequence-to-sequence based neur...
research
06/14/2018

Structure-Infused Copy Mechanisms for Abstractive Summarization

Seq2seq learning has produced promising results on summarization. Howeve...
research
07/19/2018

Sequence to Logic with Copy and Cache

Generating logical form equivalents of human language is a fresh way to ...
research
06/22/2022

Hierarchical Context Tagging for Utterance Rewriting

Utterance rewriting aims to recover coreferences and omitted information...
research
12/29/2019

Copy Move Source-Target Disambiguation through Multi-Branch CNNs

We propose a method to identify the source and target regions of a copy-...

Please sign up or login with your details

Forgot password? Click here to reset