Improving Scene Text Image Super-Resolution via Dual Prior Modulation Network

02/21/2023
by   Shipeng Zhu, et al.
0

Scene text image super-resolution (STISR) aims to simultaneously increase the resolution and legibility of the text images, and the resulting images will significantly affect the performance of downstream tasks. Although numerous progress has been made, existing approaches raise two crucial issues: (1) They neglect the global structure of the text, which bounds the semantic determinism of the scene text. (2) The priors, e.g., text prior or stroke prior, employed in existing works, are extracted from pre-trained text recognizers. That said, such priors suffer from the domain gap including low resolution and blurriness caused by poor imaging conditions, leading to incorrect guidance. Our work addresses these gaps and proposes a plug-and-play module dubbed Dual Prior Modulation Network (DPMN), which leverages dual image-level priors to bring performance gain over existing approaches. Specifically, two types of prior-guided refinement modules, each using the text mask or graphic recognition result of the low-quality SR image from the preceding layer, are designed to improve the structural clarity and semantic accuracy of the text, respectively. The following attention mechanism hence modulates two quality-enhanced images to attain a superior SR result. Extensive experiments validate that our method improves the image quality and boosts the performance of downstream tasks over five typical approaches on the benchmark. Substantial visualizations and ablation studies demonstrate the advantages of the proposed DPMN. Code is available at: https://github.com/jdfxzzy/DPMN.

READ FULL TEXT

page 1

page 2

page 3

page 7

research
03/17/2022

A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-resolution

Scene text image super-resolution aims to increase the resolution and re...
research
06/29/2021

Text Prior Guided Scene Text Image Super-resolution

Scene text image super-resolution (STISR) aims to improve the resolution...
research
08/13/2023

TextDiff: Mask-Guided Residual Diffusion Models for Scene Text Image Super-Resolution

The goal of scene text image super-resolution is to reconstruct high-res...
research
08/14/2022

Global Priors Guided Modulation Network for Joint Super-Resolution and Inverse Tone-Mapping

Joint super-resolution and inverse tone-mapping (SR-ITM) aims to enhance...
research
12/13/2021

Text Gestalt: Stroke-Aware Scene Text Image Super-Resolution

In the last decade, the blossom of deep learning has witnessed the rapid...
research
06/04/2023

ESTISR: Adapting Efficient Scene Text Image Super-resolution for Real-Scenes

While scene text image super-resolution (STISR) has yielded remarkable i...
research
05/27/2020

SPIN: Structure-Preserving Inner Offset Network for Scene Text Recognition

Arbitrary text appearance poses a great challenge in scene text recognit...

Please sign up or login with your details

Forgot password? Click here to reset