Evaluating generative audio systems and their metrics

08/31/2022
by   Ashvala Vinay, et al.
0

Recent years have seen considerable advances in audio synthesis with deep generative models. However, the state-of-the-art is very difficult to quantify; different studies often use different evaluation methodologies and different metrics when reporting results, making a direct comparison to other systems difficult if not impossible. Furthermore, the perceptual relevance and meaning of the reported metrics in most cases unknown, prohibiting any conclusive insights with respect to practical usability and audio quality. This paper presents a study that investigates state-of-the-art approaches side-by-side with (i) a set of previously proposed objective metrics for audio reconstruction, and with (ii) a listening study. The results indicate that currently used objective metrics are insufficient to describe the perceptual quality of current systems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/19/2023

What You Hear Is What You See: Audio Quality Metrics From Image Quality Metrics

In this study, we investigate the feasibility of utilizing state-of-the-...
research
12/22/2021

Perceptual Evaluation of 360 Audiovisual Quality and Machine Learning Predictions

In an earlier study, we gathered perceptual evaluations of the audio, vi...
research
06/22/2022

A Study on the Evaluation of Generative Models

Implicit generative models, which do not return likelihood values, such ...
research
08/03/2021

A Benchmarking Initiative for Audio-Domain Music Generation Using the Freesound Loop Dataset

This paper proposes a new benchmark task for generat-ing musical passage...
research
07/06/2021

A Multi-Objective Approach for Sustainable Generative Audio Models

In recent years, the deep learning community has largely focused on the ...
research
12/02/2022

Can we still use PEAQ? A Performance Analysis of the ITU Standard for the Objective Assessment of Perceived Audio Quality

The Perceptual Evaluation of Audio Quality (PEAQ) method as described in...
research
04/01/2020

Improving Perceptual Quality of Drum Transcription with the Expanded Groove MIDI Dataset

Classifier metrics, such as accuracy and F-measure score, often serve as...

Please sign up or login with your details

Forgot password? Click here to reset