MSLKANet: A Multi-Scale Large Kernel Attention Network for Scene Text Removal

11/12/2022
by   Guangtao Lyu, et al.
0

Scene text removal aims to remove the text and fill the regions with perceptually plausible background information in natural images. It has attracted increasing attention due to its various applications in privacy protection, scene text retrieval, and text editing. With the development of deep learning, the previous methods have achieved significant improvements. However, most of the existing methods seem to ignore the large perceptive fields and global information. The pioneer method can get significant improvements by only changing training data from the cropped image to the full image. In this paper, we present a single-stage multi-scale network MSLKANet for scene text removal in full images. For obtaining large perceptive fields and global information, we propose multi-scale large kernel attention (MSLKA) to obtain long-range dependencies between the text regions and the backgrounds at various granularity levels. Furthermore, we combine the large kernel decomposition mechanism and atrous spatial pyramid pooling to build a large kernel spatial pyramid pooling (LKSPP), which can perceive more valid pixels in the spatial dimension while maintaining large receptive fields and low cost of computation. Extensive experimental results indicate that the proposed method achieves state-of-the-art performance on both synthetic and real-world datasets and the effectiveness of the proposed components MSLKA and LKSPP.

READ FULL TEXT

page 5

page 6

research
09/01/2023

Selective Scene Text Removal

Scene text removal (STR) is the image transformation task to remove text...
research
11/25/2022

Aggregated Text Transformer for Scene Text Detection

This paper explores the multi-scale aggregation strategy for scene text ...
research
12/05/2022

Exploring Stroke-Level Modifications for Scene Text Editing

Scene text editing (STE) aims to replace text with the desired one while...
research
11/19/2020

Scene text removal via cascaded text stroke detection and erasing

Recent learning-based approaches show promising performance improvement ...
research
07/28/2023

DocDeshadower: Frequency-aware Transformer for Document Shadow Removal

The presence of shadows significantly impacts the visual quality of scan...
research
03/11/2019

MTRNet: A Generic Scene Text Eraser

Text removal algorithms have been proposed for uni-lingual scripts with ...
research
11/22/2018

Mask R-CNN with Pyramid Attention Network for Scene Text Detection

In this paper, we present a new Mask R-CNN based text detection approach...

Please sign up or login with your details

Forgot password? Click here to reset