Joint Layout Analysis, Character Detection and Recognition for Historical Document Digitization

07/14/2020
by   Weihong Ma, et al.
5

In this paper, we propose an end-to-end trainable framework for restoring historical documents content that follows the correct reading order. In this framework, two branches named character branch and layout branch are added behind the feature extraction network. The character branch localizes individual characters in a document image and recognizes them simultaneously. Then we adopt a post-processing method to group them into text lines. The layout branch based on fully convolutional network outputs a binary mask. We then use Hough transform for line detection on the binary mask and combine character results with the layout information to restore document content. These two branches can be trained in parallel and are easy to train. Furthermore, we propose a re-score mechanism to minimize recognition error. Experiment results on the extended Chinese historical document MTHv2 dataset demonstrate the effectiveness of the proposed framework.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

page 6

research
05/14/2019

A human-inspired recognition system for premodern Japanese historical documents

Recognition of historical documents is a challenging problem due to the ...
research
04/07/2021

Document Layout Analysis via Dynamic Residual Feature Fusion

The document layout analysis (DLA) aims to split the document image into...
research
08/26/2023

Bengali Document Layout Analysis with Detectron2

Document digitization is vital for preserving historical records, effici...
research
03/08/2019

ICDAR 2019 Historical Document Reading Challenge on Large Structured Chinese Family Records

We propose a Historical Document Reading Challenge on Large Chinese Stru...
research
10/22/2018

Baseline Detection in Historical Documents using Convolutional U-Nets

Baseline detection is still a challenging task for heterogeneous collect...
research
12/15/2019

Indiscapes: Instance Segmentation Networks for Layout Parsing of Historical Indic Manuscripts

Historical palm-leaf manuscript and early paper documents from Indian su...
research
03/25/2019

DeepCenterline: a Multi-task Fully Convolutional Network for Centerline Extraction

A novel centerline extraction framework is reported which combines an en...

Please sign up or login with your details

Forgot password? Click here to reset