DocDeshadower: Frequency-aware Transformer for Document Shadow Removal

07/28/2023
by   Shenghong Luo, et al.
0

The presence of shadows significantly impacts the visual quality of scanned documents. However, the existing traditional techniques and deep learning methods used for shadow removal have several limitations. These methods either rely heavily on heuristics, resulting in suboptimal performance, or require large datasets to learn shadow-related features. In this study, we propose the DocDeshadower, a multi-frequency Transformer-based model built on Laplacian Pyramid. DocDeshadower is designed to remove shadows at different frequencies in a coarse-to-fine manner. To achieve this, we decompose the shadow image into different frequency bands using Laplacian Pyramid. In addition, we introduce two novel components to this model: the Attention-Aggregation Network and the Gated Multi-scale Fusion Transformer. The Attention-Aggregation Network is designed to remove shadows in the low-frequency part of the image, whereas the Gated Multi-scale Fusion Transformer refines the entire image at a global scale with its large perceptive field. Our extensive experiments demonstrate that DocDeshadower outperforms the current state-of-the-art methods in both qualitative and quantitative terms.

READ FULL TEXT

page 1

page 3

page 6

page 7

research
08/26/2023

Devignet: High-Resolution Vignetting Removal via a Dual Aggregated Fusion Transformer With Adaptive Channel Expansion

Vignetting commonly occurs as a degradation in images resulting from fac...
research
09/13/2023

ShaDocFormer: A Shadow-attentive Threshold Detector with Cascaded Fusion Refiner for document shadow removal

Document shadow is a common issue that arise when capturing documents us...
research
11/30/2022

ShaDocNet: Learning Spatial-Aware Tokens in Transformer for Document Shadow Removal

Shadow removal improves the visual quality and legibility of digital cop...
research
11/21/2018

Gated Context Aggregation Network for Image Dehazing and Deraining

Image dehazing aims to recover the uncorrupted content from a hazy image...
research
11/10/2021

Multi-Scale Single Image Dehazing Using Laplacian and Gaussian Pyramids

Model driven single image dehazing was widely studied on top of differen...
research
11/12/2022

MSLKANet: A Multi-Scale Large Kernel Attention Network for Scene Text Removal

Scene text removal aims to remove the text and fill the regions with per...
research
11/30/2022

Two-branch Multi-scale Deep Neural Network for Generalized Document Recapture Attack Detection

The image recapture attack is an effective image manipulation method to ...

Please sign up or login with your details

Forgot password? Click here to reset