Towards Robust Scene Text Image Super-resolution via Explicit Location Enhancement

07/19/2023
by   Hang Guo, et al.
0

Scene text image super-resolution (STISR), aiming to improve image quality while boosting downstream scene text recognition accuracy, has recently achieved great success. However, most existing methods treat the foreground (character regions) and background (non-character regions) equally in the forward process, and neglect the disturbance from the complex background, thus limiting the performance. To address these issues, in this paper, we propose a novel method LEMMA that explicitly models character regions to produce high-level text-specific guidance for super-resolution. To model the location of characters effectively, we propose the location enhancement module to extract character region features based on the attention map sequence. Besides, we propose the multi-modal alignment module to perform bidirectional visual-semantic alignment to generate high-quality prior guidance, which is then incorporated into the super-resolution branch in an adaptive manner using the proposed adaptive fusion module. Experiments on TextZoom and four scene text recognition benchmarks demonstrate the superiority of our method over other state-of-the-art methods. Code is available at https://github.com/csguoh/LEMMA.

READ FULL TEXT

page 1

page 7

research
04/29/2022

C3-STISR: Scene Text Image Super-resolution with Triple Clues

Scene text image super-resolution (STISR) has been regarded as an import...
research
12/13/2021

Text Gestalt: Stroke-Aware Scene Text Image Super-Resolution

In the last decade, the blossom of deep learning has witnessed the rapid...
research
08/13/2023

TextDiff: Mask-Guided Residual Diffusion Models for Scene Text Image Super-Resolution

The goal of scene text image super-resolution is to reconstruct high-res...
research
07/21/2023

Character Time-series Matching For Robust License Plate Recognition

Automatic License Plate Recognition (ALPR) is becoming a popular study a...
research
09/27/2016

Blind Facial Image Quality Enhancement using Non-Rigid Semantic Patches

We propose to combine semantic data and registration algorithms to solve...
research
07/31/2023

HiREN: Towards Higher Supervision Quality for Better Scene Text Image Super-Resolution

Scene text image super-resolution (STISR) is an important pre-processing...
research
03/26/2023

Learning Generative Structure Prior for Blind Text Image Super-resolution

Blind text image super-resolution (SR) is challenging as one needs to co...

Please sign up or login with your details

Forgot password? Click here to reset