VTAMIQ: Transformers for Attention Modulated Image Quality Assessment

10/04/2021
by   Andrei Chubarau, et al.
0

Following the major successes of self-attention and Transformers for image analysis, we investigate the use of such attention mechanisms in the context of Image Quality Assessment (IQA) and propose a novel full-reference IQA method, Vision Transformer for Attention Modulated Image Quality (VTAMIQ). Our method achieves competitive or state-of-the-art performance on the existing IQA datasets and significantly outperforms previous metrics in cross-database evaluations. Most patch-wise IQA methods treat each patch independently; this partially discards global information and limits the ability to model long-distance interactions. We avoid this problem altogether by employing a transformer to encode a sequence of patches as a single global representation, which by design considers interdependencies between patches. We rely on various attention mechanisms – first with self-attention within the Transformer, and second with channel attention within our difference modulation network – specifically to reveal and enhance the more salient features throughout our architecture. With large-scale pre-training for both classification and IQA tasks, VTAMIQ generalizes well to unseen sets of images and distortions, further demonstrating the strength of transformer-based networks for vision modelling.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/16/2023

Blind Image Quality Assessment via Transformer Predicted Error Map and Perceptual Quality Token

Image quality assessment is a fundamental problem in the field of image ...
research
08/09/2021

TransForensics: Image Forgery Localization with Dense Self-Attention

Nowadays advanced image editing tools and technical skills produce tampe...
research
08/16/2021

No-Reference Image Quality Assessment via Transformers, Relative Ranking, and Self-Consistency

The goal of No-Reference Image Quality Assessment (NR-IQA) is to estimat...
research
02/27/2023

Mask Reference Image Quality Assessment

Understanding semantic information is an essential step in knowing what ...
research
01/06/2022

TransVPR: Transformer-based place recognition with multi-level attention aggregation

Visual place recognition is a challenging task for applications such as ...
research
04/14/2023

Masked Pre-Training of Transformers for Histology Image Analysis

In digital pathology, whole slide images (WSIs) are widely used for appl...
research
05/27/2022

Transformers from an Optimization Perspective

Deep learning models such as the Transformer are often constructed by he...

Please sign up or login with your details

Forgot password? Click here to reset