Language Independent Neuro-Symbolic Semantic Parsing for Form Understanding

05/08/2023
by   Bhanu Prakash Voutharoja, et al.
0

Recent works on form understanding mostly employ multimodal transformers or large-scale pre-trained language models. These models need ample data for pre-training. In contrast, humans can usually identify key-value pairings from a form only by looking at layouts, even if they don't comprehend the language used. No prior research has been conducted to investigate how helpful layout information alone is for form understanding. Hence, we propose a unique entity-relation graph parsing method for scanned forms called LAGNN, a language-independent Graph Neural Network model. Our model parses a form into a word-relation graph in order to identify entities and relations jointly and reduce the time complexity of inference. This graph is then transformed by deterministic rules into a fully connected entity-relation graph. Our model simply takes into account relative spacing between bounding boxes from layout information to facilitate easy transfer across languages. To further improve the performance of LAGNN, and achieve isomorphism between entity-relation graphs and word-relation graphs, we use integer linear programming (ILP) based inference. Code is publicly available at https://github.com/Bhanu068/LAGNN

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/28/2022

LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding

Structured document understanding has attracted considerable attention a...
research
12/30/2020

ERICA: Improving Entity and Relation Understanding for Pre-trained Language Models via Contrastive Learning

Pre-trained Language Models (PLMs) have shown strong performance in vari...
research
05/24/2023

RE^2: Region-Aware Relation Extraction from Visually Rich Documents

Current research in form understanding predominantly relies on large pre...
research
09/29/2022

TERMinator: A system for scientific texts processing

This paper is devoted to the extraction of entities and semantic relatio...
research
09/05/2022

RLIP: Relational Language-Image Pre-training for Human-Object Interaction Detection

The task of Human-Object Interaction (HOI) detection targets fine-graine...
research
05/24/2023

Towards Few-shot Entity Recognition in Document Images: A Graph Neural Network Approach Robust to Image Manipulation

Recent advances of incorporating layout information, typically bounding ...
research
05/19/2022

Plane Geometry Diagram Parsing

Geometry diagram parsing plays a key role in geometry problem solving, w...

Please sign up or login with your details

Forgot password? Click here to reset