Audio Similarity is Unreliable as a Proxy for Audio Quality

06/27/2022
by   Pranay Manocha, et al.
0

Many audio processing tasks require perceptual assessment. However, the time and expense of obtaining “gold standard” human judgments limit the availability of such data. Most applications incorporate full reference or other similarity-based metrics (e.g. PESQ) that depend on a clean reference. Researchers have relied on such metrics to evaluate and compare various proposed methods, often concluding that small, measured differences imply one is more effective than another. This paper demonstrates several practical scenarios where similarity metrics fail to agree with human perception, because they: (1) vary with clean references; (2) rely on attributes that humans factor out when considering quality, and (3) are sensitive to imperceptible signal level differences. In those scenarios, we show that no-reference metrics do not suffer from such shortcomings and correlate better with human perception. We conclude therefore that similarity serves as an unreliable proxy for audio quality.

READ FULL TEXT
research
10/28/2020

DNSMOS: A Non-Intrusive Perceptual Objective Speech Quality metric to evaluate Noise Suppressors

Human subjective evaluation is the gold standard to evaluate speech qual...
research
05/19/2023

What You Hear Is What You See: Audio Quality Metrics From Image Quality Metrics

In this study, we investigate the feasibility of utilizing state-of-the-...
research
10/26/2021

AQP: An Open Modular Python Platform for Objective Speech and Audio Quality Metrics

Audio quality assessment has been widely researched in the signal proces...
research
09/16/2021

NORESQA – A Framework for Speech Quality Assessment using Non-Matching References

The perceptual task of speech quality assessment (SQA) is a challenging ...
research
12/20/2022

DocAsRef: A Pilot Empirical Study on Repurposing Reference-Based Summary Quality Metrics Reference-Freely

Summary quality assessment metrics have two categories: reference-based ...
research
05/16/2020

Exploration of Audio Quality Assessment and Anomaly Localisation Using Attention Models

Many applications of speech technology require more and more audio data....
research
01/13/2020

A Differentiable Perceptual Audio Metric Learned from Just Noticeable Differences

Assessment of many audio processing tasks relies on subjective evaluatio...

Please sign up or login with your details

Forgot password? Click here to reset