DPLM: A Deep Perceptual Spatial-Audio Localization Metric

05/29/2021
by   Pranay Manocha, et al.
0

Subjective evaluations are critical for assessing the perceptual realism of sounds in audio-synthesis driven technologies like augmented and virtual reality. However, they are challenging to set up, fatiguing for users, and expensive. In this work, we tackle the problem of capturing the perceptual characteristics of localizing sounds. Specifically, we propose a framework for building a general purpose quality metric to assess spatial localization differences between two binaural recordings. We model localization similarity by utilizing activation-level distances from deep networks trained for direction of arrival (DOA) estimation. Our proposed metric (DPLM) outperforms baseline metrics on correlation with subjective ratings on a diverse set of datasets, even without the benefit of any human-labeled training data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/24/2022

SAQAM: Spatial Audio Quality Assessment Metric

Audio quality assessment is critical for assessing the perceptual realis...
research
10/28/2020

DNSMOS: A Non-Intrusive Perceptual Objective Speech Quality metric to evaluate Noise Suppressors

Human subjective evaluation is the gold standard to evaluate speech qual...
research
03/13/2018

3D Video Quality Assessment

A key factor in designing 3D systems is to understand how different visu...
research
01/13/2020

A Differentiable Perceptual Audio Metric Learned from Just Noticeable Differences

Assessment of many audio processing tasks relies on subjective evaluatio...
research
08/21/2019

Scoot: A Perceptual Metric for Facial Sketches

While it is trivial for humans to quickly assess the perceptual similari...
research
12/08/2022

A Data-driven Cognitive Salience Model for Objective Perceptual Audio Quality Assessment

Objective audio quality measurement systems often use perceptual models ...
research
02/09/2021

CDPAM: Contrastive learning for perceptual audio similarity

Many speech processing methods based on deep learning require an automat...

Please sign up or login with your details

Forgot password? Click here to reset