Towards a Perceptual Model for Estimating the Quality of Visual Speech

03/18/2022
by   Zakaria Aldeneh, et al.
0

Generating realistic lip motions to simulate speech production is key for driving natural character animations from audio. Previous research has shown that traditional metrics used to optimize and assess models for generating lip motions from speech are not a good indicator of subjective opinion of animation quality. Yet, running repetitive subjective studies for assessing the quality of animations can be time-consuming and difficult to replicate. In this work, we seek to understand the relationship between perturbed lip motion and subjective opinion of lip motion quality. Specifically, we adjust the degree of articulation for lip motion sequences and run a user-study to examine how this adjustment impacts the perceived quality of lip motion. We then train a model using the scores collected from our user-study to automatically predict the subjective quality of an animated sequence. Our results show that (1) users score lip motions with slight over-articulation the highest in terms of perceptual quality; (2) under-articulation had a more detrimental effect on perceived quality of lip motion compared to the effect of over-articulation; and (3) we can automatically estimate the subjective perceptual score for a given lip motion sequences with low error rates.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/26/2022

Naturalistic Head Motion Generation from Speech

Synthesizing natural head motion to accompany speech for an embodied con...
research
05/16/2022

Perceptual Evaluation on Audio-visual Dataset of 360 Content

To open up new possibilities to assess the multimodal perceptual quality...
research
03/13/2018

3D Video Quality Assessment

A key factor in designing 3D systems is to understand how different visu...
research
06/16/2022

To Dereverb Or Not to Dereverb? Perceptual Studies On Real-Time Dereverberation Targets

In real life, room effect, also known as room reverberation, and the pre...
research
06/18/2023

MOSPC: MOS Prediction Based on Pairwise Comparison

As a subjective metric to evaluate the quality of synthesized speech, Me...
research
03/22/2021

A Perceptual Model of Musical Mix Clarity using Decomposition and Masking Thresholds

Objective measurement of perceptually motivated music attributes has app...
research
03/13/2018

Effect of Eye Dominance on the Perception of Stereoscopic 3D Video

Asymmetric schemes have widespread applications in the 3D video transmis...

Please sign up or login with your details

Forgot password? Click here to reset