MaxSR: Image Super-Resolution Using Improved MaxViT

07/14/2023
by   Bincheng Yang, et al.
0

While transformer models have been demonstrated to be effective for natural language processing tasks and high-level vision tasks, only a few attempts have been made to use powerful transformer models for single image super-resolution. Because transformer models have powerful representation capacity and the in-built self-attention mechanisms in transformer models help to leverage self-similarity prior in input low-resolution image to improve performance for single image super-resolution, we present a single image super-resolution model based on recent hybrid vision transformer of MaxViT, named as MaxSR. MaxSR consists of four parts, a shallow feature extraction block, multiple cascaded adaptive MaxViT blocks to extract deep hierarchical features and model global self-similarity from low-level features efficiently, a hierarchical feature fusion block, and finally a reconstruction block. The key component of MaxSR, i.e., adaptive MaxViT block, is based on MaxViT block which mixes MBConv with squeeze-and-excitation, block attention and grid attention. In order to achieve better global modelling of self-similarity in input low-resolution image, we improve block attention and grid attention in MaxViT block to adaptive block attention and adaptive grid attention which do self-attention inside each window across all grids and each grid across all windows respectively in the most efficient way. We instantiate proposed model for classical single image super-resolution (MaxSR) and lightweight single image super-resolution (MaxSR-light). Experiments show that our MaxSR and MaxSR-light establish new state-of-the-art performance efficiently.

READ FULL TEXT

page 2

page 5

page 8

research
10/20/2022

Single Image Super-Resolution Using Lightweight Networks Based on Swin Transformer

Image super-resolution reconstruction is an important task in the field ...
research
03/17/2023

SRFormer: Permuted Self-Attention for Single Image Super-Resolution

Previous works have shown that increasing the window size for Transforme...
research
06/04/2023

ESTISR: Adapting Efficient Scene Text Image Super-resolution for Real-Scenes

While scene text image super-resolution (STISR) has yielded remarkable i...
research
05/19/2023

Efficient Mixed Transformer for Single Image Super-Resolution

Recently, Transformer-based methods have achieved impressive results in ...
research
06/07/2022

Hierarchical Similarity Learning for Aliasing Suppression Image Super-Resolution

As a highly ill-posed issue, single image super-resolution (SISR) has be...
research
06/24/2022

Bilateral Network with Channel Splitting Network and Transformer for Thermal Image Super-Resolution

In recent years, the Thermal Image Super-Resolution (TISR) problem has b...
research
03/11/2023

Recursive Generalization Transformer for Image Super-Resolution

Transformer architectures have exhibited remarkable performance in image...

Please sign up or login with your details

Forgot password? Click here to reset