DUET: Detection Utilizing Enhancement for Text in Scanned or Captured Documents

06/10/2021
by   Eun-Soo Jung, et al.
0

We present a novel deep neural model for text detection in document images. For robust text detection in noisy scanned documents, the advantages of multi-task learning are adopted by adding an auxiliary task of text enhancement. Namely, our proposed model is designed to perform noise reduction and text region enhancement as well as text detection. Moreover, we enrich the training data for the model with synthesized document images that are fully labeled for text detection and enhancement, thus overcome the insufficiency of labeled document image data. For the effective exploitation of the synthetic and real data, the training process is separated in two phases. The first phase is training only synthetic data in a fully-supervised manner. Then real data with only detection labels are added in the second phase. The enhancement task for the real data is weakly-supervised with information from their detection labels. Our methods are demonstrated in a real document dataset with performances exceeding those of other text detection methods. Moreover, ablations are conducted and the results confirm the effectiveness of the synthetic data, auxiliary task, and weak-supervision. Whereas the existing text detection studies mostly focus on the text in scenes, our proposed method is optimized to the applications for the text in scanned documents.

READ FULL TEXT

page 1

page 3

page 4

page 5

research
01/13/2022

Weakly Supervised Scene Text Detection using Deep Reinforcement Learning

The challenging field of scene text detection requires complex data anno...
research
09/02/2018

Weakly-Supervised Neural Text Classification

Deep neural networks are gaining increasing popularity for the classic t...
research
10/02/2010

A Microwave Imaging and Enhancement Technique from Noisy Synthetic Data

An inverse iterative algorithm for microwave imaging based on moment met...
research
02/11/2022

Towards Weakly-Supervised Text Spotting using a Multi-Task Transformer

Text spotting end-to-end methods have recently gained attention in the l...
research
07/17/2017

Aesthetic-Driven Image Enhancement by Adversarial Learning

We introduce EnhanceGAN, an adversarial learning based model that perfor...
research
08/16/2023

Detecting Olives with Synthetic or Real Data? Olive the Above

Modern robotics has enabled the advancement in yield estimation for prec...
research
02/10/2023

Semi-supervised Large-scale Fiber Detection in Material Images with Synthetic Data

Accurate detection of large-scale, elliptical-shape fibers, including th...

Please sign up or login with your details

Forgot password? Click here to reset