PARAGRAPH2GRAPH: A GNN-based framework for layout paragraph analysis

04/24/2023
by   Shu Wei, et al.
0

Document layout analysis has a wide range of requirements across various domains, languages, and business scenarios. However, most current state-of-the-art algorithms are language-dependent, with architectures that rely on transformer encoders or language-specific text encoders, such as BERT, for feature extraction. These approaches are limited in their ability to handle very long documents due to input sequence length constraints and are closely tied to language-specific tokenizers. Additionally, training a cross-language text encoder can be challenging due to the lack of labeled multilingual document datasets that consider privacy. Furthermore, some layout tasks require a clean separation between different layout components without overlap, which can be difficult for image segmentation-based algorithms to achieve. In this paper, we present Paragraph2Graph, a language-independent graph neural network (GNN)-based model that achieves competitive results on common document layout datasets while being adaptable to business scenarios with strict separation. With only 19.95 million parameters, our model is suitable for industrial applications, particularly in multi-language scenarios.

READ FULL TEXT

page 4

page 14

research
06/22/2018

Multi-Task Handwritten Document Layout Analysis

Document Layout Analysis is a fundamental step in Handwritten Text Proce...
research
01/27/2022

DocSegTr: An Instance-Level End-to-End Document Image Segmentation Transformer

Understanding documents with rich layouts is an essential step towards i...
research
08/03/2023

A Graphical Approach to Document Layout Analysis

Document layout analysis (DLA) is the task of detecting the distinct, se...
research
02/18/2021

Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer

We address the challenging problem of Natural Language Comprehension bey...
research
04/12/2022

Neural Graph Matching for Modification Similarity Applied to Electronic Document Comparison

In this paper, we present a novel neural graph matching approach applied...

Please sign up or login with your details

Forgot password? Click here to reset