SAM-IQA: Can Segment Anything Boost Image Quality Assessment?

07/10/2023
by   Xinpeng Li, et al.
0

Image Quality Assessment (IQA) is a challenging task that requires training on massive datasets to achieve accurate predictions. However, due to the lack of IQA data, deep learning-based IQA methods typically rely on pre-trained networks trained on massive datasets as feature extractors to enhance their generalization ability, such as the ResNet network trained on ImageNet. In this paper, we utilize the encoder of Segment Anything, a recently proposed segmentation model trained on a massive dataset, for high-level semantic feature extraction. Most IQA methods are limited to extracting spatial-domain features, while frequency-domain features have been shown to better represent noise and blur. Therefore, we leverage both spatial-domain and frequency-domain features by applying Fourier and standard convolutions on the extracted features, respectively. Extensive experiments are conducted to demonstrate the effectiveness of all the proposed components, and results show that our approach outperforms the state-of-the-art (SOTA) in four representative datasets, both qualitatively and quantitatively. Our experiments confirm the powerful feature extraction capabilities of Segment Anything and highlight the value of combining spatial-domain and frequency-domain features in IQA tasks. Code: https://github.com/Hedlen/SAM-IQA

READ FULL TEXT
research
04/20/2022

Multi-Scale Features and Parallel Transformers Based Image Quality Assessment

With the increase in multimedia content, the type of distortions associa...
research
09/06/2022

High Dynamic Range Image Quality Assessment Based on Frequency Disparity

In this paper, a novel and effective image quality assessment (IQA) algo...
research
12/01/2020

Deep Multi-Scale Features Learning for Distorted Image Quality Assessment

Image quality assessment (IQA) aims to estimate human perception based i...
research
03/29/2021

Motion Basis Learning for Unsupervised Deep Homography Estimation with Subspace Projection

In this paper, we introduce a new framework for unsupervised deep homogr...
research
07/15/2020

Learning to Parse Wireframes in Images of Man-Made Environments

In this paper, we propose a learning-based approach to the task of autom...
research
05/24/2023

Collaborative Auto-encoding for Blind Image Quality Assessment

Blind image quality assessment (BIQA) is a challenging problem with impo...
research
03/23/2022

Deep Frequency Filtering for Domain Generalization

Improving the generalization capability of Deep Neural Networks (DNNs) i...

Please sign up or login with your details

Forgot password? Click here to reset