Big Bidirectional Insertion Representations for Documents

10/29/2019
by   Lala Li, et al.
0

The Insertion Transformer is well suited for long form text generation due to its parallel generation capabilities, requiring O(log_2 n) generation steps to generate n tokens. However, modeling long sequences is difficult, as there is more ambiguity captured in the attention mechanism. This work proposes the Big Bidirectional Insertion Representations for Documents (Big BIRD), an insertion-based model for document-level translation tasks. We scale up the insertion-based models to long form documents. Our key contribution is introducing sentence alignment via sentence-positional embeddings between the source and target document. We show an improvement of +4.3 BLEU on the WMT'19 English→German document-level translation task compared with the Insertion Transformer baseline.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/02/2021

Hi-Transformer: Hierarchical Interactive Transformer for Efficient and Effective Long Document Modeling

Transformer is important for text modeling. However, it has difficulty i...
research
10/16/2022

Modeling Context With Linear Attention for Scalable Document-Level Translation

Document-level machine translation leverages inter-sentence dependencies...
research
04/30/2020

Exploiting Sentence Order in Document Alignment

In this work, we exploit the simple idea that a document and its transla...
research
05/16/2021

Doc2Dict: Information Extraction as Text Generation

Typically, information extraction (IE) requires a pipeline approach: fir...
research
07/30/2019

English-Czech Systems in WMT19: Document-Level Transformer

We describe our NMT systems submitted to the WMT19 shared task in Englis...
research
06/07/2021

Diverse Pretrained Context Encodings Improve Document Translation

We propose a new architecture for adapting a sentence-level sequence-to-...
research
10/03/2020

Multilevel Text Alignment with Cross-Document Attention

Text alignment finds application in tasks such as citation recommendatio...

Please sign up or login with your details

Forgot password? Click here to reset