Hi-Transformer: Hierarchical Interactive Transformer for Efficient and Effective Long Document Modeling

06/02/2021
by   Chuhan Wu, et al.
0

Transformer is important for text modeling. However, it has difficulty in handling long documents due to the quadratic complexity with input text length. In order to handle this problem, we propose a hierarchical interactive Transformer (Hi-Transformer) for efficient and effective long document modeling. Hi-Transformer models documents in a hierarchical way, i.e., first learns sentence representations and then learns document representations. It can effectively reduce the complexity and meanwhile capture global document context in the modeling of each sentence. More specifically, we first use a sentence Transformer to learn the representations of each sentence. Then we use a document Transformer to model the global document context from these sentence representations. Next, we use another sentence Transformer to enhance sentence modeling using the global document context. Finally, we use hierarchical pooling method to obtain document embedding. Extensive experiments on three benchmark datasets validate the efficiency and effectiveness of Hi-Transformer in long document modeling.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/09/2022

HEGEL: Hypergraph Transformer for Long Document Summarization

Extractive summarization for long documents is challenging due to the ex...
research
10/29/2019

Big Bidirectional Insertion Representations for Documents

The Insertion Transformer is well suited for long form text generation d...
research
11/20/2022

SeDR: Segment Representation Learning for Long Documents Dense Retrieval

Recently, Dense Retrieval (DR) has become a promising solution to docume...
research
11/08/2019

Question Generation from Paragraphs: A Tale of Two Hierarchical Models

Automatic question generation from paragraphs is an important and challe...
research
10/17/2022

Effective and Efficient Query-aware Snippet Extraction for Web Search

Query-aware webpage snippet extraction is widely used in search engines ...
research
08/31/2019

Humor Detection: A Transformer Gets the Last Laugh

Much previous work has been done in attempting to identify humor in text...
research
07/04/2022

Understanding Performance of Long-Document Ranking Models through Comprehensive Evaluation and Leaderboarding

We carry out a comprehensive evaluation of 13 recent models for ranking ...

Please sign up or login with your details

Forgot password? Click here to reset