High-Resolution Document Shadow Removal via A Large-Scale Real-World Dataset and A Frequency-Aware Shadow Erasing Net

08/27/2023
by   Zinuo Li, et al.
0

Shadows often occur when we capture the documents with casual equipment, which influences the visual quality and readability of the digital copies. Different from the algorithms for natural shadow removal, the algorithms in document shadow removal need to preserve the details of fonts and figures in high-resolution input. Previous works ignore this problem and remove the shadows via approximate attention and small datasets, which might not work in real-world situations. We handle high-resolution document shadow removal directly via a larger-scale real-world dataset and a carefully designed frequency-aware network. As for the dataset, we acquire over 7k couples of high-resolution (2462 x 3699) images of real-world document pairs with various samples under different lighting circumstances, which is 10 times larger than existing datasets. As for the design of the network, we decouple the high-resolution images in the frequency domain, where the low-frequency details and high-frequency boundaries can be effectively learned via the carefully designed network structure. Powered by our network and dataset, the proposed method clearly shows a better performance than previous methods in terms of visual quality and numerical results. The code, models, and dataset are available at: https://github.com/CXH-Research/DocShadow-SD7K

READ FULL TEXT

page 1

page 3

page 4

page 5

page 6

page 8

page 12

page 13

research
08/26/2023

Devignet: High-Resolution Vignetting Removal via a Dual Aggregated Fusion Transformer With Adaptive Channel Expansion

Vignetting commonly occurs as a degradation in images resulting from fac...
research
03/22/2023

LP-IOANet: Efficient High Resolution Document Shadow Removal

Document shadow removal is an integral task in document enhancement pipe...
research
08/31/2022

Injecting Image Details into CLIP's Feature Space

Although CLIP-like Visual Language Models provide a functional joint fea...
research
04/27/2022

HRDA: Context-Aware High-Resolution Domain-Adaptive Semantic Segmentation

Unsupervised domain adaptation (UDA) aims to adapt a model trained on th...
research
04/01/2023

Automatic High Resolution Wire Segmentation and Removal

Wires and powerlines are common visual distractions that often undermine...
research
11/17/2022

AligNeRF: High-Fidelity Neural Radiance Fields via Alignment-Aware Training

Neural Radiance Fields (NeRFs) are a powerful representation for modelin...
research
06/09/2023

DocAligner: Annotating Real-world Photographic Document Images by Simply Taking Pictures

Recently, there has been a growing interest in research concerning docum...

Please sign up or login with your details

Forgot password? Click here to reset