Including Keyword Position in Image-based Models for Act Segmentation of Historical Registers

09/17/2021
by   Mélodie Boillet, et al.
0

The segmentation of complex images into semantic regions has seen a growing interest these last years with the advent of Deep Learning. Until recently, most existing methods for Historical Document Analysis focused on the visual appearance of documents, ignoring the rich information that textual content can offer. However, the segmentation of complex documents into semantic regions is sometimes impossible relying only on visual features and recent models embed both visual and textual information. In this paper, we focus on the use of both visual and textual information for segmenting historical registers into structured and meaningful units such as acts. An act is a text recording containing valuable knowledge such as demographic information (baptism, marriage or death) or royal decisions (donation or pardon). We propose a simple pipeline to enrich document images with the position of text lines containing key-phrases and show that running a standard image-based layout analysis system on these images can lead to significant gains. Our experiments show that the detection of acts increases from 38 information, in real use-case conditions where text lines positions and content are extracted with an automatic recognition system.

READ FULL TEXT
research
02/14/2020

Combining Visual and Textual Features for Semantic Segmentation of Historical Newspapers

The massive amounts of digitized historical documents acquired over the ...
research
04/16/2020

PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks

Computer vision with state-of-the-art deep learning models has achieved ...
research
12/12/2022

Page Layout Analysis of Text-heavy Historical Documents: a Comparison of Textual and Visual Approaches

Page layout analysis is a fundamental step in document processing which ...
research
07/15/2019

Multimodal deep networks for text and image-based document classification

Classification of document images is a critical step for archival of old...
research
12/23/2021

Digital Editions as Distant Supervision for Layout Analysis of Printed Books

Archivists, textual scholars, and historians often produce digital editi...
research
06/21/2018

Don't only Feel Read: Using Scene text to understand advertisements

We propose a framework for automated classification of Advertisement Ima...
research
01/08/2019

GILT: Generating Images from Long Text

Creating an image reflecting the content of a long text is a complex pro...

Please sign up or login with your details

Forgot password? Click here to reset