Fréchet Audio Distance: A Metric for Evaluating Music Enhancement Algorithms

12/20/2018
by   Kevin Kilgour, et al.
0

We propose the Fréchet Audio Distance (FAD), a novel, reference-free evaluation metric for music enhancement algorithms. We demonstrate how typical evaluation metrics for speech enhancement and blind source separation can fail to accurately measure the perceived effect of a wide variety of distortions. As an alternative, we propose adapting the Fréchet Inception Distance (FID) metric used to evaluate generative image models to the audio domain. FAD is validated using a wide variety of artificial distortions and is compared to the signal based metrics signal to distortion ratio (SDR), cosine distance and magnitude L2 distance. We show that, with a correlation coefficient of 0.52, FAD correlates more closely with human perception than either SDR, cosine distance or magnitude L2 distance, with correlation coefficients of 0.39, -0.15 and -0.01 respectively.

READ FULL TEXT

page 6

page 7

page 8

page 9

page 10

page 13

page 14

page 17

research
02/16/2022

On loss functions and evaluation metrics for music source separation

We investigate which loss functions provide better separations via bench...
research
11/27/2018

Improved Speech Enhancement with the Wave-U-Net

We study the use of the Wave-U-Net architecture for speech enhancement, ...
research
05/27/2021

An Improved Measure of Musical Noise Based on Spectral Kurtosis

Audio processing methods operating on a time-frequency representation of...
research
08/23/2022

Parameter Sensitivity of Deep-Feature based Evaluation Metrics for Audio Textures

Standard evaluation metrics such as the Inception score and Fréchet Audi...
research
04/28/2022

Music Enhancement via Image Translation and Vocoding

Consumer-grade music recordings such as those captured by mobile devices...
research
05/09/2022

Improved Evaluation and Generation of Grid Layouts using Distance Preservation Quality and Linear Assignment Sorting

Images sorted by similarity enables more images to be viewed simultaneou...
research
06/25/2020

Dialogue Enhancement in Object-based Audio – Evaluating the Benefit on People above 65

Due to age-related hearing loss, elderly people often struggle with foll...

Please sign up or login with your details

Forgot password? Click here to reset