Towards a perceptual distance metric for auditory stimuli

10/30/2020
by   Sarah Oh, et al.
0

Although perceptual (dis)similarity between sensory stimuli seems akin to distance, measuring the Euclidean distance between vector representations of auditory stimuli is a poor estimator of subjective dissimilarity. In hearing, nonlinear response patterns, interactions between stimulus components, temporal effects, and top-down modulation transform the information contained in incoming frequency-domain stimuli in a way that seems to preserve some notion of distance, but not that of familiar Euclidean space. This work proposes that transformations applied to auditory stimuli during hearing can be modeled as a function mapping stimulus points to their representations in a perceptual space, inducing a Riemannian distance metric. A dataset was collected in a subjective listening experiment, the results of which were used to explore approaches (biologically inspired, data-driven, and combinations thereof) to approximating the perceptual map. Each of the proposed measures achieved comparable or stronger correlations with subjective ratings (r   0.8) compared to state-of-the-art audio quality measures.

READ FULL TEXT

page 19

page 30

page 34

research
08/07/2023

Screen-based 3D Subjective Experiment Software

Recently, widespread 3D graphics (e.g., point clouds and meshes) have dr...
research
02/16/2020

Exploring crossmodal perceptual enhancement and integration in a sequence-reproducing task with cognitive priming

Leveraging the perceptual phenomenon of crossmoal correspondence has bee...
research
02/07/2020

Audio-Visual-Olfactory Resource Allocation for Tri-modal Virtual Environments

Virtual Environments (VEs) provide the opportunity to simulate a wide ra...
research
04/15/2019

Proximal binaural sound can induce subjective frisson

Sound frisson is a subjective experience wherein people tend to perceive...
research
07/13/2018

Towards Modeling the Interaction of Spatial-Associative Neural Network Representations for Multisensory Perception

Our daily perceptual experience is driven by different neural mechanisms...
research
09/22/2022

Predicting pairwise preferences between TTS audio stimuli using parallel ratings data and anti-symmetric twin neural networks

Automatically predicting the outcome of subjective listening tests is a ...

Please sign up or login with your details

Forgot password? Click here to reset