From Coarse to Fine: Hierarchical Pixel Integration for Lightweight Image Super-Resolution

11/30/2022
by   Jie Liu, et al.
0

Image super-resolution (SR) serves as a fundamental tool for the processing and transmission of multimedia data. Recently, Transformer-based models have achieved competitive performances in image SR. They divide images into fixed-size patches and apply self-attention on these patches to model long-range dependencies among pixels. However, this architecture design is originated for high-level vision tasks, which lacks design guideline from SR knowledge. In this paper, we aim to design a new attention block whose insights are from the interpretation of Local Attribution Map (LAM) for SR networks. Specifically, LAM presents a hierarchical importance map where the most important pixels are located in a fine area of a patch and some less important pixels are spread in a coarse area of the whole image. To access pixels in the coarse area, instead of using a very large patch size, we propose a lightweight Global Pixel Access (GPA) module that applies cross-attention with the most similar patch in an image. In the fine area, we use an Intra-Patch Self-Attention (IPSA) module to model long-range pixel dependencies in a local patch, and then a 3×3 convolution is applied to process the finest details. In addition, a Cascaded Patch Division (CPD) strategy is proposed to enhance perceptual quality of recovered images. Extensive experiments suggest that our method outperforms state-of-the-art lightweight SR methods by a large margin. Code is available at https://github.com/passerer/HPINet.

READ FULL TEXT

page 3

page 5

page 6

research
03/13/2022

Efficient Long-Range Attention Network for Image Super-resolution

Recently, transformer-based methods have demonstrated impressive results...
research
05/09/2022

Activating More Pixels in Image Super-Resolution Transformer

Transformer-based methods have shown impressive performance in low-level...
research
04/20/2023

Omni Aggregation Networks for Lightweight Image Super-Resolution

While lightweight ViT framework has made tremendous progress in image su...
research
10/02/2020

Efficient Image Super-Resolution Using Pixel Attention

This work aims at designing a lightweight convolutional neural network f...
research
08/30/2022

ASpanFormer: Detector-Free Image Matching with Adaptive Span Transformer

Generating robust and reliable correspondences across images is a fundam...
research
08/05/2023

Unfolding Once is Enough: A Deployment-Friendly Transformer Unit for Super-Resolution

Recent years have witnessed a few attempts of vision transformers for si...
research
03/03/2022

ViTransPAD: Video Transformer using convolution and self-attention for Face Presentation Attack Detection

Face Presentation Attack Detection (PAD) is an important measure to prev...

Please sign up or login with your details

Forgot password? Click here to reset