A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-resolution

03/17/2022
by   Jianqi Ma, et al.
0

Scene text image super-resolution aims to increase the resolution and readability of the text in low-resolution images. Though significant improvement has been achieved by deep convolutional neural networks (CNNs), it remains difficult to reconstruct high-resolution images for spatially deformed texts, especially rotated and curve-shaped ones. This is because the current CNN-based methods adopt locality-based operations, which are not effective to deal with the variation caused by deformations. In this paper, we propose a CNN based Text ATTention network (TATT) to address this problem. The semantics of the text are firstly extracted by a text recognition module as text prior information. Then we design a novel transformer-based module, which leverages global attention mechanism, to exert the semantic guidance of text prior to the text reconstruction process. In addition, we propose a text structure consistency loss to refine the visual appearance by imposing structural consistency on the reconstructions of regular and deformed texts. Experiments on the benchmark TextZoom dataset show that the proposed TATT not only achieves state-of-the-art performance in terms of PSNR/SSIM metrics, but also significantly improves the recognition accuracy in the downstream text recognition task, particularly for text instances with multi-orientation and curved shapes. Code is available at https://github.com/mjq11302010044/TATT.

READ FULL TEXT

page 1

page 6

page 8

research
06/29/2021

Text Prior Guided Scene Text Image Super-resolution

Scene text image super-resolution (STISR) aims to improve the resolution...
research
02/21/2023

Improving Scene Text Image Super-Resolution via Dual Prior Modulation Network

Scene text image super-resolution (STISR) aims to simultaneously increas...
research
12/13/2021

Text Gestalt: Stroke-Aware Scene Text Image Super-Resolution

In the last decade, the blossom of deep learning has witnessed the rapid...
research
05/09/2023

TPS++: Attention-Enhanced Thin-Plate Spline for Scene Text Recognition

Text irregularities pose significant challenges to scene text recognizer...
research
04/29/2022

C3-STISR: Scene Text Image Super-resolution with Triple Clues

Scene text image super-resolution (STISR) has been regarded as an import...
research
06/04/2023

ESTISR: Adapting Efficient Scene Text Image Super-resolution for Real-Scenes

While scene text image super-resolution (STISR) has yielded remarkable i...
research
11/22/2018

Mask R-CNN with Pyramid Attention Network for Scene Text Detection

In this paper, we present a new Mask R-CNN based text detection approach...

Please sign up or login with your details

Forgot password? Click here to reset