ESTISR: Adapting Efficient Scene Text Image Super-resolution for Real-Scenes

06/04/2023
by   Minghao Fu, et al.
0

While scene text image super-resolution (STISR) has yielded remarkable improvements in accurately recognizing scene text, prior methodologies have placed excessive emphasis on optimizing performance, rather than paying due attention to efficiency - a crucial factor in ensuring deployment of the STISR-STR pipeline. In this work, we propose a novel Efficient Scene Text Image Super-resolution (ESTISR) Network for resource-limited deployment platform. ESTISR's functionality primarily depends on two critical components: a CNN-based feature extractor and an efficient self-attention mechanism used for decoding low-resolution images. We designed a re-parameterized inverted residual block specifically suited for resource-limited circumstances as the feature extractor. Meanwhile, we proposed a novel self-attention mechanism, softmax shrinking, based on a kernel-based approach. This innovative technique offers linear complexity while also naturally incorporating discriminating low-level features into the self-attention structure. Extensive experiments on TextZoom show that ESTISR retains a high image restoration quality and improved STR accuracy of low-resolution images. Furthermore, ESTISR consistently outperforms current methods in terms of actual running time and peak memory consumption, while achieving a better trade-off between performance and efficiency.

READ FULL TEXT

page 3

page 9

research
07/14/2023

MaxSR: Image Super-Resolution Using Improved MaxViT

While transformer models have been demonstrated to be effective for natu...
research
05/07/2020

Scene Text Image Super-Resolution in the Wild

Low-resolution text images are often seen in natural scenes such as docu...
research
11/17/2022

RDRN: Recursively Defined Residual Network for Image Super-Resolution

Deep convolutional neural networks (CNNs) have obtained remarkable perfo...
research
07/17/2023

DARTS: Double Attention Reference-based Transformer for Super-resolution

We present DARTS, a transformer model for reference-based image super-re...
research
03/17/2022

A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-resolution

Scene text image super-resolution aims to increase the resolution and re...
research
03/11/2023

Recursive Generalization Transformer for Image Super-Resolution

Transformer architectures have exhibited remarkable performance in image...
research
02/21/2023

Improving Scene Text Image Super-Resolution via Dual Prior Modulation Network

Scene text image super-resolution (STISR) aims to simultaneously increas...

Please sign up or login with your details

Forgot password? Click here to reset