Scene Text Image Super-Resolution via Content Perceptual Loss and Criss-Cross Transformer Blocks

10/13/2022
by   Rui Qin, et al.
0

Text image super-resolution is a unique and important task to enhance readability of text images to humans. It is widely used as pre-processing in scene text recognition. However, due to the complex degradation in natural scenes, recovering high-resolution texts from the low-resolution inputs is ambiguous and challenging. Existing methods mainly leverage deep neural networks trained with pixel-wise losses designed for natural image reconstruction, which ignore the unique character characteristics of texts. A few works proposed content-based losses. However, they only focus on text recognizers' accuracy, while the reconstructed images may still be ambiguous to humans. Further, they often have weak generalizability to handle cross languages. To this end, we present TATSR, a Text-Aware Text Super-Resolution framework, which effectively learns the unique text characteristics using Criss-Cross Transformer Blocks (CCTBs) and a novel Content Perceptual (CP) Loss. The CCTB extracts vertical and horizontal content information from text images by two orthogonal transformers, respectively. The CP Loss supervises the text reconstruction with content semantics by multi-scale text recognition features, which effectively incorporates content awareness into the framework. Extensive experiments on various language datasets demonstrate that TATSR outperforms state-of-the-art methods in terms of both recognition accuracy and human perception.

READ FULL TEXT

page 1

page 4

page 5

page 6

page 7

page 8

page 9

page 11

research
09/16/2019

TextSR: Content-Aware Text Super-Resolution Guided by Recognition

Scene text recognition has witnessed rapid development with the advance ...
research
05/07/2020

Scene Text Image Super-Resolution in the Wild

Low-resolution text images are often seen in natural scenes such as docu...
research
04/29/2022

C3-STISR: Scene Text Image Super-resolution with Triple Clues

Scene text image super-resolution (STISR) has been regarded as an import...
research
11/25/2019

Cascaded Detail-Preserving Networks for Super-Resolution of Document Images

The accuracy of OCR is usually affected by the quality of the input docu...
research
07/31/2023

HiREN: Towards Higher Supervision Quality for Better Scene Text Image Super-Resolution

Scene text image super-resolution (STISR) is an important pre-processing...
research
09/03/2023

Orientation-Independent Chinese Text Recognition in Scene Images

Scene text recognition (STR) has attracted much attention due to its bro...

Please sign up or login with your details

Forgot password? Click here to reset