Learning Generalized Spatial-Temporal Deep Feature Representation for No-Reference Video Quality Assessment

12/27/2020
by   Baoliang Chen, et al.
3

In this work, we propose a no-reference video quality assessment method, aiming to achieve high-generalization capability in cross-content, -resolution and -frame rate quality prediction. In particular, we evaluate the quality of a video by learning effective feature representations in spatial-temporal domain. In the spatial domain, to tackle the resolution and content variations, we impose the Gaussian distribution constraints on the quality features. The unified distribution can significantly reduce the domain gap between different video samples, resulting in a more generalized quality feature representation. Along the temporal dimension, inspired by the mechanism of visual perception, we propose a pyramid temporal aggregation module by involving the short-term and long-term memory to aggregate the frame-level quality. Experiments show that our method outperforms the state-of-the-art methods on cross-dataset settings, and achieves comparable performance on intra-dataset configurations, demonstrating the high-generalization capability of the proposed method.

READ FULL TEXT

page 1

page 3

page 7

research
08/01/2019

Quality Assessment of In-the-Wild Videos

Quality assessment of in-the-wild videos is a challenging problem becaus...
research
01/15/2023

Learning Sparse Temporal Video Mapping for Action Quality Assessment in Floor Gymnastics

Athlete performance measurement in sports videos requires modeling long ...
research
06/20/2022

DisCoVQA: Temporal Distortion-Content Transformers for Video Quality Assessment

The temporal relationships between frames and their influences on video ...
research
11/17/2022

Generalizable Deepfake Detection with Phase-Based Motion Analysis

We propose PhaseForensics, a DeepFake (DF) video detection method that l...
research
08/19/2021

More for Less: Non-Intrusive Speech Quality Assessment with Limited Annotations

Non-intrusive speech quality assessment is a crucial operation in multim...
research
10/30/2017

Prediction of Satisfied User Ratio for Compressed Video

A large-scale video quality dataset called the VideoSet has been constru...
research
06/19/2020

Capturing Video Frame Rate Variations via Entropic Differencing

High frame rate videos are increasingly getting popular in recent years,...

Please sign up or login with your details

Forgot password? Click here to reset