Learning Document Image Binarization from Data

05/04/2015
by   Yue Wu, et al.
0

In this paper we present a fully trainable binarization solution for degraded document images. Unlike previous attempts that often used simple features with a series of pre- and post-processing, our solution encodes all heuristics about whether or not a pixel is foreground text into a high-dimensional feature vector and learns a more complicated decision function. In particular, we prepare features of three types: 1) existing features for binarization such as intensity [1], contrast [2], [3], and Laplacian [4], [5]; 2) reformulated features from existing binarization decision functions such those in [6] and [7]; and 3) our newly developed features, namely the Logarithm Intensity Percentile (LIP) and the Relative Darkness Index (RDI). Our initial experimental results show that using only selected samples (about 1.5 available training data), we can achieve a binarization performance comparable to those fine-tuned (typically by hand), state-of-the-art methods. Additionally, the trained document binarization classifier shows good generalization capabilities on out-of-domain data.

READ FULL TEXT

page 5

page 6

page 7

page 8

page 11

research
12/03/2021

The Influence of Data Pre-processing and Post-processing on Long Document Summarization

Long document summarization is an important and hard task in the field o...
research
10/14/2022

Text Detection Forgot About Document OCR

Detection and recognition of text from scans and other images, commonly ...
research
10/28/2022

Facial Action Unit Detection and Intensity Estimation from Self-supervised Representation

As a fine-grained and local expression behavior measurement, facial acti...
research
06/25/2020

Cascading Modular U-Nets for Document Image Binarization

In recent years, U-Net has achieved good results in various image proces...
research
02/28/2022

LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding

Structured document understanding has attracted considerable attention a...
research
05/16/2023

Leaf Only SAM: A Segment Anything Pipeline for Zero-Shot Automated Leaf Segmentation

Segment Anything Model (SAM) is a new foundation model that can be used ...
research
07/13/2019

Motion Segmentation Using Locally Affine Atom Voting

We present a novel method for motion segmentation called LAAV (Locally A...

Please sign up or login with your details

Forgot password? Click here to reset