LSTM-based Video Quality Prediction Accounting for Temporal Distortions in Videoconferencing Calls

03/22/2023
by   Gabriel Mittag, et al.
0

Current state-of-the-art video quality models, such as VMAF, give excellent prediction results by comparing the degraded video with its reference video. However, they do not consider temporal distortions (e.g., frame freezes or skips) that occur during videoconferencing calls. In this paper, we present a data-driven approach for modeling such distortions automatically by training an LSTM with subjective quality ratings labeled via crowdsourcing. The videos were collected from live videoconferencing calls in 83 different network conditions. We applied QR codes as markers on the source videos to create aligned references and compute temporal features based on the alignment vectors. Using these features together with VMAF core features, our proposed model achieves a PCC of 0.99 on the validation set. Furthermore, our model outputs per-frame quality that gives detailed insight into the cause of video quality impairments. The VCM model and dataset are open-sourced at https://github.com/microsoft/Video_Call_MOS.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/03/2022

BVI-VFI: A Video Quality Database for Video Frame Interpolation

Video frame interpolation (VFI) is a fundamental research topic in video...
research
06/19/2020

Capturing Video Frame Rate Variations via Entropic Differencing

High frame rate videos are increasingly getting popular in recent years,...
research
10/26/2020

ST-GREED: Space-Time Generalized Entropic Differences for Frame Rate Dependent Video Quality Prediction

We consider the problem of conducting frame rate dependent video quality...
research
07/29/2020

DNN No-Reference PSTN Speech Quality Prediction

Classic public switched telephone networks (PSTN) are often a black box ...
research
04/13/2018

SpatioTemporal Feature Integration and Model Fusion for Full Reference Video Quality Assessment

Perceptual video quality assessment models are either frame-based or vid...
research
07/26/2021

Temporal Alignment Prediction for Few-Shot Video Classification

The goal of few-shot video classification is to learn a classification m...
research
05/11/2023

Undercover Deepfakes: Detecting Fake Segments in Videos

The recent renaissance in generative models, driven primarily by the adv...

Please sign up or login with your details

Forgot password? Click here to reset