GroupLink: An End-to-end Multitask Method for Word Grouping and Relation Extraction in Form Understanding

05/10/2021
by   Zilong Wang, et al.
0

Forms are a common type of document in real life and carry rich information through textual contents and the organizational structure. To realize automatic processing of forms, word grouping and relation extraction are two fundamental and crucial steps after preliminary processing of optical character reader (OCR). Word grouping is to aggregate words that belong to the same semantic entity, and relation extraction is to predict the links between semantic entities. Existing works treat them as two individual tasks, but these two tasks are correlated and can reinforce each other. The grouping process will refine the integrated representation of the corresponding entity, and the linking process will give feedback to the grouping performance. For this purpose, we acquire multimodal features from both textual data and layout information and build an end-to-end model through multitask training to combine word grouping and relation extraction to enhance performance on each task. We validate our proposed method on a real-world, fully-annotated, noisy-scanned benchmark, FUNSD, and extensive experiments demonstrate the effectiveness of our method.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/10/2022

AIFB-WebScience at SemEval-2022 Task 12: Relation Extraction First – Using Relation Extraction to Identify Entities

In this paper, we present an end-to-end joint entity and relation extrac...
research
08/26/2018

Scientific Relation Extraction with Selectively Incorporated Concept Embeddings

This paper describes our submission for the SemEval 2018 Task 7 shared t...
research
05/24/2023

RE^2: Region-Aware Relation Extraction from Visually Rich Documents

Current research in form understanding predominantly relies on large pre...
research
07/09/2021

Form2Seq : A Framework for Higher-Order Form Structure Extraction

Document structure extraction has been a widely researched area for deca...
research
06/02/2021

End-to-End Hierarchical Relation Extraction for Generic Form Understanding

Form understanding is a challenging problem which aims to recognize sema...
research
11/11/2022

Unimodal and Multimodal Representation Training for Relation Extraction

Multimodal integration of text, layout and visual information has achiev...
research
06/09/2021

UniKeyphrase: A Unified Extraction and Generation Framework for Keyphrase Prediction

Keyphrase Prediction (KP) task aims at predicting several keyphrases tha...

Please sign up or login with your details

Forgot password? Click here to reset