A selectional auto-encoder approach for document image binarization

06/30/2017
by   Jorge Calvo-Zaragoza, et al.
0

Binarization plays a key role in the automatic information retrieval from document images. This process is usually performed in the first stages of documents analysis systems, and serves as a basis for subsequent steps. Hence it has to be robust in order to allow the full analysis workflow to be successful. Several methods for document image binarization have been proposed so far, most of which are based on hand-crafted image processing strategies. Recently, Convolutional Neural Networks have shown an amazing performance in many disparate duties related to computer vision. In this paper we discuss the use of convolutional auto-encoders devoted to learning an end-to-end map from an input image to its selectional output, in which activations indicate the likelihood of pixels to be either foreground or background. Once trained, documents can therefore be binarized by parsing them through the model and applying a threshold. This approach has proven to outperform existing binarization strategies in a number of document domains.

READ FULL TEXT
research
02/23/2021

Neural Ranking Models for Document Retrieval

Ranking models are the main components of information retrieval systems....
research
10/03/2022

EraseNet: A Recurrent Residual Network for Supervised Document Cleaning

Document denoising is considered one of the most challenging tasks in co...
research
09/09/2017

Image Processing Operations Identification via Convolutional Neural Network

In recent years, image forensics has attracted more and more attention, ...
research
02/25/2015

Evaluation of Deep Convolutional Nets for Document Image Classification and Retrieval

This paper presents a new state-of-the-art for document image classifica...
research
08/09/2017

Anveshak - A Groundtruth Generation Tool for Foreground Regions of Document Images

We propose a graphical user interface based groundtruth generation tool ...
research
12/12/2021

Magnifying Networks for Images with Billions of Pixels

The shift towards end-to-end deep learning has brought unprecedented adv...
research
02/17/2019

Multiple Document Representations from News Alerts for Automated Bio-surveillance Event Detection

Due to globalization, geographic boundaries no longer serve as effective...

Please sign up or login with your details

Forgot password? Click here to reset