ROPE: Reading Order Equivariant Positional Encoding for Graph-based Document Information Extraction

06/21/2021
by   Chen-Yu Lee, et al.
0

Natural reading orders of words are crucial for information extraction from form-like documents. Despite recent advances in Graph Convolutional Networks (GCNs) on modeling spatial layout patterns of documents, they have limited ability to capture reading orders of given word-level node representations in a graph. We propose Reading Order Equivariant Positional Encoding (ROPE), a new positional encoding technique designed to apprehend the sequential presentation of words in documents. ROPE generates unique reading order codes for neighboring words relative to the target word given a word-level graph connectivity. We study two fundamental document entity extraction tasks including word labeling and word grouping on the public FUNSD dataset and a large-scale payment dataset. We show that ROPE consistently improves existing GCNs with a margin up to 8.4

READ FULL TEXT

page 7

page 8

research
08/26/2021

LayoutReader: Pre-training of Text and Layout for Reading Order Detection

Reading order detection is the cornerstone to understanding visually-ric...
research
05/19/2019

DivGraphPointer: A Graph Pointer Network for Extracting Diverse Keyphrases

Keyphrase extraction from documents is useful to a variety of applicatio...
research
04/08/2023

Word-level Persian Lipreading Dataset

Lip-reading has made impressive progress in recent years, driven by adva...
research
05/17/2022

Multidisciplinary Reading Patterns of Digital Documents

Reading plays a vital role in updating the researchers on recent develop...
research
08/29/2018

Question Answering by Reasoning Across Documents with Graph Convolutional Networks

Most research in reading comprehension has focused on answering question...
research
03/16/2022

FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction

Sequence modeling has demonstrated state-of-the-art performance on natur...
research
05/04/2023

Text Reading Order in Uncontrolled Conditions by Sparse Graph Segmentation

Text reading order is a crucial aspect in the output of an OCR engine, w...

Please sign up or login with your details

Forgot password? Click here to reset