Transformer for Image Quality Assessment

12/30/2020
by   Junyong You, et al.
0

Transformer has become the new standard method in natural language processing (NLP), and it also attracts research interests in computer vision area. In this paper we investigate the application of Transformer in Image Quality (TRIQ) assessment. Following the original Transformer encoder employed in Vision Transformer (ViT), we propose an architecture of using a shallow Transformer encoder on the top of a feature map extracted by convolution neural networks (CNN). Adaptive positional embedding is employed in the Transformer encoder to handle images with arbitrary resolutions. Different settings of Transformer architectures have been investigated on publicly available image quality databases. We have found that the proposed TRIQ architecture achieves outstanding performance. The implementation of TRIQ is published on Github (https://github.com/junyongyou/triq).

READ FULL TEXT

page 2

page 5

research
05/16/2023

Blind Image Quality Assessment via Transformer Predicted Error Map and Perceptual Quality Token

Image quality assessment is a fundamental problem in the field of image ...
research
01/30/2023

Half of an image is enough for quality assessment

Deep networks show promising performance in image quality assessment (IQ...
research
08/30/2021

A Battle of Network Structures: An Empirical Study of CNN, Transformer, and MLP

Convolutional neural networks (CNN) are the dominant deep neural network...
research
11/29/2021

On the rate of convergence of a classifier based on a Transformer encoder

Pattern recognition based on a high-dimensional predictor is considered....
research
07/20/2023

Hybrid Feature Embedding For Automatic Building Outline Extraction

Building outline extracted from high-resolution aerial images can be use...
research
01/04/2022

PyramidTNT: Improved Transformer-in-Transformer Baselines with Pyramid Architecture

Transformer networks have achieved great progress for computer vision ta...
research
05/17/2022

POViT: Vision Transformer for Multi-objective Design and Characterization of Nanophotonic Devices

We solve a fundamental challenge in semiconductor IC design: the fast an...

Please sign up or login with your details

Forgot password? Click here to reset