Rethinking Super-Resolution as Text-Guided Details Generation

07/14/2022
by   Chenxi Ma, et al.
0

Deep neural networks have greatly promoted the performance of single image super-resolution (SISR). Conventional methods still resort to restoring the single high-resolution (HR) solution only based on the input of image modality. However, the image-level information is insufficient to predict adequate details and photo-realistic visual quality facing large upscaling factors (x8, x16). In this paper, we propose a new perspective that regards the SISR as a semantic image detail enhancement problem to generate semantically reasonable HR image that are faithful to the ground truth. To enhance the semantic accuracy and the visual quality of the reconstructed image, we explore the multi-modal fusion learning in SISR by proposing a Text-Guided Super-Resolution (TGSR) framework, which can effectively utilize the information from the text and image modalities. Different from existing methods, the proposed TGSR could generate HR image details that match the text descriptions through a coarse-to-fine process. Extensive experiments and ablation studies demonstrate the effect of the TGSR, which exploits the text reference to recover realistic images.

READ FULL TEXT

page 2

page 4

page 6

page 7

page 8

research
12/18/2016

Learning a No-Reference Quality Metric for Single-Image Super-Resolution

Numerous single-image super-resolution algorithms have been proposed in ...
research
06/29/2021

Text Prior Guided Scene Text Image Super-resolution

Scene text image super-resolution (STISR) aims to improve the resolution...
research
08/23/2019

DRFN: Deep Recurrent Fusion Network for Single-Image Super-Resolution with Large Factors

Recently, single-image super-resolution has made great progress owing to...
research
05/29/2019

Towards Real Scene Super-Resolution with Raw Images

Most existing super-resolution methods do not perform well in real scena...
research
04/21/2020

Weakly Aligned Joint Cross-Modality Super Resolution

Non-visual imaging sensors are widely used in the industry for different...
research
06/22/2022

A Fast Text-Driven Approach for Generating Artistic Content

In this work, we propose a complete framework that generates visual art....
research
03/08/2020

Domain-Specific Image Super-Resolution with Progressive Adversarial Network

Single Image Super-Resolution (SISR) aims to improve resolution of small...

Please sign up or login with your details

Forgot password? Click here to reset