Spatial Dependency Parsing for 2D Document Understanding

05/01/2020
by   Wonseok Hwang, et al.
0

Information Extraction (IE) for document images is often approached as a BIO tagging problem, where the model sequentially goes through and classifies each recognized input token into one of the information categories. However, such problem setup has two inherent limitations that (1) it can only extract a flat list of information and (2) it assumes that the input data is serialized, often by a simple rule-based script. Nevertheless, real-world documents often contain hierarchical information in the form of two-dimensional language data in which the serialization can be highly non-trivial. To tackle these issues, we propose SPADE (SPatial DEpendency parser), an end-to-end spatial dependency parser that is serializer-free and capable of modeling an arbitrary number of information layers, making it suitable for parsing structure-rich documents such as receipts and multimodal documents such as name cards. We show that SPADE outperforms the previous BIO tagging-based approach on name card parsing task and achieves comparable performance on receipt parsing task. Especially, when the receipt images have non-flat manifold representing physical distortion of receipt paper in real-world, SPADE outperforms the tagging-based method by a large margin of 25.8 the strong performance of SPADE over spatially complex document.

READ FULL TEXT

page 6

page 8

research
07/11/2018

An improved neural network model for joint POS tagging and dependency parsing

We propose a novel neural network model for joint part-of-speech (POS) t...
research
09/26/2017

Object-oriented Neural Programming (OONP) for Document Understanding

We propose Object-oriented Neural Programming (OONP), a framework for se...
research
04/20/2017

Neural End-to-End Learning for Computational Argumentation Mining

We investigate neural techniques for end-to-end computational argumentat...
research
12/24/2020

ThamizhiUDp: A Dependency Parser for Tamil

This paper describes how we developed a neural-based dependency parser, ...
research
09/02/2018

Neural Ranking Models for Temporal Dependency Structure Parsing

We design and build the first neural temporal dependency parser. It util...
research
12/01/2022

PIZZA: A new benchmark for complex end-to-end task-oriented parsing

Much recent work in task-oriented parsing has focused on finding a middl...

Please sign up or login with your details

Forgot password? Click here to reset