Semantic Parsing of Interpage Relations

05/26/2022
by   Mehmet Arif Demirtaş, et al.
0

Page-level analysis of documents has been a topic of interest in digitization efforts, and multimodal approaches have been applied to both classification and page stream segmentation. In this work, we focus on capturing finer semantic relations between pages of a multi-page document. To this end, we formalize the task as semantic parsing of interpage relations and we propose an end-to-end approach for interpage dependency extraction, inspired by the dependency parsing literature. We further design a multi-task training approach to jointly optimize for page embeddings to be used in segmentation, classification, and parsing of the page dependencies using textual and visual features extracted from the pages. Moreover, we also combine the features from two modalities to obtain multimodal page embeddings. To the best of our knowledge, this is the first study to extract rich semantic interpage relations from multi-page documents. Our experimental results show that the proposed method increased LAS by 41 percentage points for semantic parsing, increased accuracy by 33 percentage points for page stream segmentation, and 45 percentage points for page classification over a naive baseline.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/09/2017

Page Stream Segmentation with Convolutional Neural Nets Combining Textual and Visual Features

For digitization of paper files via OCR, preservation of document contex...
research
04/25/2017

Automatic Compositor Attribution in the First Folio of Shakespeare

Compositor attribution, the clustering of pages in a historical printed ...
research
07/02/2022

Sequence-aware multimodal page classification of Brazilian legal documents

The Brazilian Supreme Court receives tens of thousands of cases each sem...
research
11/24/2021

Handling tree-structured text: parsing directory pages

The determination of the reading sequence of text is fundamental to docu...
research
08/24/2023

Beyond Document Page Classification: Design, Datasets, and Challenges

This paper highlights the need to bring document classification benchmar...
research
08/04/2021

Multi-Round Parsing-based Multiword Rules for Scientific OpenIE

Information extraction (IE) in scientific literature has facilitated man...
research
12/12/2022

Page Layout Analysis of Text-heavy Historical Documents: a Comparison of Textual and Visual Approaches

Page layout analysis is a fundamental step in document processing which ...

Please sign up or login with your details

Forgot password? Click here to reset