Form-NLU: Dataset for the Form Language Understanding

04/04/2023
by   Yihao Ding, et al.
0

Compared to general document analysis tasks, form document structure understanding and retrieval are challenging. Form documents are typically made by two types of authors; A form designer, who develops the form structure and keys, and a form user, who fills out form values based on the provided keys. Hence, the form values may not be aligned with the form designer's intention (structure and keys) if a form user gets confused. In this paper, we introduce Form-NLU, the first novel dataset for form structure understanding and its key and value information extraction, interpreting the form designer's intent and the alignment of user-written value on it. It consists of 857 form images, 6k form keys and values, and 4k table keys and values. Our dataset also includes three form types: digital, printed, and handwritten, which cover diverse form appearances and layouts. We propose a robust positional and logical relation-based form key-value information extraction framework. Using this dataset, Form-NLU, we first examine strong object detection models for the form layout understanding, then evaluate the key information extraction task on the dataset, providing fine-grained results for different types of forms and keys. Furthermore, we examine it with the off-the-shelf pdf layout extraction tool and prove its feasibility in real-world cases.

READ FULL TEXT
research
08/23/2022

Doc2Graph: a Task Agnostic Document Understanding Framework based on Graph Neural Networks

Geometric Deep Learning has recently attracted significant interest in a...
research
06/14/2022

RDU: A Region-based Approach to Form-style Document Understanding

Key Information Extraction (KIE) is aimed at extracting structured infor...
research
05/16/2023

DLUE: Benchmarking Document Language Understanding

Understanding documents is central to many real-world tasks but remains ...
research
08/08/2022

Simplifying Electronic Document Digital Signatures

Electronic documents are typically signed using private keys and the mat...
research
10/11/2020

Revising FUNSD dataset for key-value detection in document images

FUNSD is one of the limited publicly available datasets for information ...
research
06/24/2021

MatchVIE: Exploiting Match Relevancy between Entities for Visual Information Extraction

Visual Information Extraction (VIE) task aims to extract key information...
research
05/17/2023

Exploring the Space of Key-Value-Query Models with Intention

Attention-based models have been a key element of many recent breakthrou...

Please sign up or login with your details

Forgot password? Click here to reset