Modelling the semantics of text in complex document layouts using graph transformer networks

02/18/2022
by   Thomas Roland Barillot, et al.
0

Representing structured text from complex documents typically calls for different machine learning techniques, such as language models for paragraphs and convolutional neural networks (CNNs) for table extraction, which prohibits drawing links between text spans from different content types. In this article we propose a model that approximates the human reading pattern of a document and outputs a unique semantic representation for every text span irrespective of the content type they are found in. We base our architecture on a graph representation of the structured text, and we demonstrate that not only can we retrieve semantically similar information across documents but also that the embedding space we generate captures useful semantic information, similar to language models that work only on text sequences.

READ FULL TEXT
research
09/16/2023

PDFTriage: Question Answering over Long, Structured Documents

Large Language Models (LLMs) have issues with document question answerin...
research
10/20/2021

Contrastive Document Representation Learning with Graph Attention Networks

Recent progress in pretrained Transformer-based language models has show...
research
03/29/2022

The Inefficiency of Language Models in Scholarly Retrieval: An Experimental Walk-through

Language models are increasingly becoming popular in AI-powered scientif...
research
05/05/2023

Expository Text Generation: Imitate, Retrieve, Paraphrase

Expository documents are vital resources for conveying complex informati...
research
01/11/2022

Structure with Semantics: Exploiting Document Relations for Retrieval

Retrieving relevant documents from a corpus is typically based on the se...
research
12/18/2017

An anthropological account of the Vim text editor: features and tweaks after 10 years of usage

The Vim text editor is very rich in capabilities and thus complex. This ...
research
09/03/2017

Understanding the Logical and Semantic Structure of Large Documents

Current language understanding approaches focus on small documents, such...

Please sign up or login with your details

Forgot password? Click here to reset