MRET: Multi-resolution Transformer for Video Quality Assessment

03/13/2023
by   Junjie Ke, et al.
0

No-reference video quality assessment (NR-VQA) for user generated content (UGC) is crucial for understanding and improving visual experience. Unlike video recognition tasks, VQA tasks are sensitive to changes in input resolution. Since large amounts of UGC videos nowadays are 720p or above, the fixed and relatively small input used in conventional NR-VQA methods results in missing high-frequency details for many videos. In this paper, we propose a novel Transformer-based NR-VQA framework that preserves the high-resolution quality information. With the multi-resolution input representation and a novel multi-resolution patch sampling mechanism, our method enables a comprehensive view of both the global video composition and local high-resolution details. The proposed approach can effectively aggregate quality information across different granularities in spatial and temporal dimensions, making the model robust to input resolution variations. Our method achieves state-of-the-art performance on large-scale UGC VQA datasets LSVQ and LSVQ-1080p, and on KoNViD-1k and LIVE-VQC without fine-tuning.

READ FULL TEXT

page 1

page 2

page 5

page 6

page 11

research
07/06/2022

FAST-VQA: Efficient End-to-end Video Quality Assessment with Fragment Sampling

Current deep video quality assessment (VQA) methods are usually with hig...
research
10/11/2022

Neighbourhood Representative Sampling for Efficient End-to-end Video Quality Assessment

The increased resolution of real-world videos presents a dilemma between...
research
09/19/2022

Panoramic Vision Transformer for Saliency Detection in 360° Videos

360^∘ video saliency detection is one of the challenging benchmarks for ...
research
04/13/2023

Zoom-VQA: Patches, Frames and Clips Integration for Video Quality Assessment

Video quality assessment (VQA) aims to simulate the human perception of ...
research
08/22/2021

StarVQA: Space-Time Attention for Video Quality Assessment

The attention mechanism is blooming in computer vision nowadays. However...
research
06/21/2023

StarVQA+: Co-training Space-Time Attention for Video Quality Assessment

Self-attention based Transformer has achieved great success in many comp...
research
09/23/2019

sZoom: A Framework for Automatic Zoom into High Resolution Surveillance Videos

Current cameras are capable of recording high resolution video. While vi...

Please sign up or login with your details

Forgot password? Click here to reset