Perceptual Quality Assessment of Omnidirectional Audio-visual Signals

07/20/2023
by   Xilei Zhu, et al.
0

Omnidirectional videos (ODVs) play an increasingly important role in the application fields of medical, education, advertising, tourism, etc. Assessing the quality of ODVs is significant for service-providers to improve the user's Quality of Experience (QoE). However, most existing quality assessment studies for ODVs only focus on the visual distortions of videos, while ignoring that the overall QoE also depends on the accompanying audio signals. In this paper, we first establish a large-scale audio-visual quality assessment dataset for omnidirectional videos, which includes 375 distorted omnidirectional audio-visual (A/V) sequences generated from 15 high-quality pristine omnidirectional A/V contents, and the corresponding perceptual audio-visual quality scores. Then, we design three baseline methods for full-reference omnidirectional audio-visual quality assessment (OAVQA), which combine existing state-of-the-art single-mode audio and video QA models via multimodal fusion strategies. We validate the effectiveness of the A/V multimodal fusion method for OAVQA on our dataset, which provides a new benchmark for omnidirectional QoE evaluation. Our dataset is available at https://github.com/iamazxl/OAVQA.

READ FULL TEXT

page 4

page 5

research
03/04/2023

Audio-Visual Quality Assessment for User Generated Content: Database and Method

With the explosive increase of User Generated Content (UGC), UGC video q...
research
08/13/2020

Automatic Quality Assessment for Audio-Visual Verification Systems. The LOVe submission to NIST SRE Challenge 2019

Fusion of scores is a cornerstone of multimodal biometric systems compos...
research
08/07/2023

AudioVMAF: Audio Quality Prediction with VMAF

Video Multimethod Assessment Fusion (VMAF) [1], [2], [3] is a popular to...
research
09/08/2023

EGOFALLS: A visual-audio dataset and benchmark for fall detection using egocentric cameras

Falls are significant and often fatal for vulnerable populations such as...
research
08/13/2023

UGC Quality Assessment: Exploring the Impact of Saliency in Deep Feature-Based Quality Assessment

The volume of User Generated Content (UGC) has increased in recent years...
research
05/03/2023

"Glitch in the Matrix!": A Large Scale Benchmark for Content Driven Audio-Visual Forgery Detection and Localization

Most deepfake detection methods focus on detecting spatial and/or spatio...
research
03/24/2020

How deep is your encoder: an analysis of features descriptors for an autoencoder-based audio-visual quality metric

The development of audio-visual quality assessment models poses a number...

Please sign up or login with your details

Forgot password? Click here to reset