Are Nearby Neighbors Relatives?: Diagnosing Deep Music Embedding Spaces

04/15/2019
by   Jaehun Kim, et al.
0

Deep neural networks have frequently been used to directly learn representations useful for a given task from raw input data. In terms of overall performance metrics, machine learning solutions employing deep representations frequently have been reported to greatly outperform those using hand-crafted feature representations. At the same time, they may pick up on aspects that are predominant in the data, yet not actually meaningful or interpretable. In this paper, we therefore propose a systematic way to diagnose the trustworthiness of deep music representations, considering musical semantics. The underlying assumption is that in case a deep representation is to be trusted, distance consistency between known related points should be maintained both in the input audio space and corresponding latent deep space. We generate known related points through semantically meaningful transformations, both considering imperceptible and graver transformations. Then, we examine within- and between-space distance consistencies, both considering audio space and latent embedded space, the latter either being a result of a conventional feature extractor or a deep encoder. We illustrate how our method, as a complement to task-specific performance, provides interpretable insight into what a network may have captured from training data signals.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/15/2020

COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations

Audio representation learning based on deep neural networks (DNNs) emerg...
research
12/14/2017

DLR : Toward a deep learned rhythmic representation for music content analysis

In the use of deep neural networks, it is crucial to provide appropriate...
research
12/14/2017

Towards Deep Modeling of Music Semantics using EEG Regularizers

Modeling of music audio semantics has been previously tackled through le...
research
04/18/2019

Inspecting and Interacting with Meaningful Music Representations using VAE

Variational Autoencoders(VAEs) have already achieved great results on im...
research
05/24/2023

Sound Design Strategies for Latent Audio Space Explorations Using Deep Learning Architectures

The research in Deep Learning applications in sound and music computing ...
research
12/11/2020

Analysis of Feature Representations for Anomalous Sound Detection

In this work, we thoroughly evaluate the efficacy of pretrained neural n...
research
07/13/2019

Learning Complex Basis Functions for Invariant Representations of Audio

Learning features from data has shown to be more successful than using h...

Please sign up or login with your details

Forgot password? Click here to reset