GDB: Gated convolutions-based Document Binarization

02/04/2023
by   Zongyuan Yang, et al.
0

Document binarization is a key pre-processing step for many document analysis tasks. However, existing methods can not extract stroke edges finely, mainly due to the fair-treatment nature of vanilla convolutions and the extraction of stroke edges without adequate supervision by boundary-related information. In this paper, we formulate text extraction as the learning of gating values and propose an end-to-end gated convolutions-based network (GDB) to solve the problem of imprecise stroke edge extraction. The gated convolutions are applied to selectively extract the features of strokes with different attention. Our proposed framework consists of two stages. Firstly, a coarse sub-network with an extra edge branch is trained to get more precise feature maps by feeding a priori mask and edge. Secondly, a refinement sub-network is cascaded to refine the output of the first stage by gated convolutions based on the sharp edge. For global information, GDB also contains a multi-scale operation to combine local and global features. We conduct comprehensive experiments on ten Document Image Binarization Contest (DIBCO) datasets from 2009 to 2019. Experimental results show that our proposed methods outperform the state-of-the-art methods in terms of all metrics on average and achieve top ranking on six benchmark datasets.

READ FULL TEXT

page 1

page 4

page 9

research
08/21/2021

Multi-scale Edge-based U-shape Network for Salient Object Detection

Deep-learning based salient object detection methods achieve great impro...
research
02/08/2022

Disentangle Saliency Detection into Cascaded Detail Modeling and Body Filling

Salient object detection has been long studied to identify the most visu...
research
03/01/2019

Pyramid Feature Selective Network for Saliency detection

Saliency detection is one of the basic challenges in computer vision. Ho...
research
12/08/2021

GCA-Net : Utilizing Gated Context Attention for Improving Image Forgery Localization and Detection

Forensic analysis depends on the identification of hidden traces from ma...
research
04/07/2019

A Dilated Inception Network for Visual Saliency Prediction

Recently, with the advent of deep convolutional neural networks (DCNN), ...
research
12/02/2022

Dunhuang murals contour generation network based on convolution and self-attention fusion

Dunhuang murals are a collection of Chinese style and national style, fo...
research
11/06/2021

Convolutional Gated MLP: Combining Convolutions gMLP

To the best of our knowledge, this is the first paper to introduce Convo...

Please sign up or login with your details

Forgot password? Click here to reset