HiREN: Towards Higher Supervision Quality for Better Scene Text Image Super-Resolution

07/31/2023
by   Minyi Zhao, et al.
0

Scene text image super-resolution (STISR) is an important pre-processing technique for text recognition from low-resolution scene images. Nowadays, various methods have been proposed to extract text-specific information from high-resolution (HR) images to supervise STISR model training. However, due to uncontrollable factors (e.g. shooting equipment, focus, and environment) in manually photographing HR images, the quality of HR images cannot be guaranteed, which unavoidably impacts STISR performance. Observing the quality issue of HR images, in this paper we propose a novel idea to boost STISR by first enhancing the quality of HR images and then using the enhanced HR images as supervision to do STISR. Concretely, we develop a new STISR framework, called High-Resolution ENhancement (HiREN) that consists of two branches and a quality estimation module. The first branch is developed to recover the low-resolution (LR) images, and the other is an HR quality enhancement branch aiming at generating high-quality (HQ) text images based on the HR images to provide more accurate supervision to the LR images. As the degradation from HQ to HR may be diverse, and there is no pixel-level supervision for HQ image generation, we design a kernel-guided enhancement network to handle various degradation, and exploit the feedback from a recognizer and text-level annotations as weak supervision signal to train the HR enhancement branch. Then, a quality estimation module is employed to evaluate the qualities of HQ images, which are used to suppress the erroneous supervision information by weighting the loss of each image. Extensive experiments on TextZoom show that HiREN can work well with most existing STISR methods and significantly boost their performances.

READ FULL TEXT

page 1

page 2

page 4

page 8

page 10

research
06/29/2021

Text Prior Guided Scene Text Image Super-resolution

Scene text image super-resolution (STISR) aims to improve the resolution...
research
08/13/2023

TextDiff: Mask-Guided Residual Diffusion Models for Scene Text Image Super-Resolution

The goal of scene text image super-resolution is to reconstruct high-res...
research
12/13/2021

Text Gestalt: Stroke-Aware Scene Text Image Super-Resolution

In the last decade, the blossom of deep learning has witnessed the rapid...
research
04/29/2022

C3-STISR: Scene Text Image Super-resolution with Triple Clues

Scene text image super-resolution (STISR) has been regarded as an import...
research
07/19/2023

Towards Robust Scene Text Image Super-resolution via Explicit Location Enhancement

Scene text image super-resolution (STISR), aiming to improve image quali...
research
10/13/2022

Scene Text Image Super-Resolution via Content Perceptual Loss and Criss-Cross Transformer Blocks

Text image super-resolution is a unique and important task to enhance re...
research
06/24/2021

Unsupervised Deep Image Stitching: Reconstructing Stitched Features to Images

Traditional feature-based image stitching technologies rely heavily on f...

Please sign up or login with your details

Forgot password? Click here to reset