Towards Explainable Music Emotion Recognition: The Route via Mid-level Features

07/08/2019
by   Shreyan Chowdhury, et al.
9

Emotional aspects play an important part in our interaction with music. However, modelling these aspects in MIR systems have been notoriously challenging since emotion is an inherently abstract and subjective experience, thus making it difficult to quantify or predict in the first place, and to make sense of the predictions in the next. In an attempt to create a model that can give a musically meaningful and intuitive explanation for its predictions, we propose a VGG-style deep neural network that learns to predict emotional characteristics of a musical piece together with (and based on) human-interpretable, mid-level perceptual features. We compare this to predicting emotion directly with an identical network that does not take into account the mid-level features and observe that the loss in predictive performance of going through the mid-level features is surprisingly low, on average. The design of our network allows us to visualize the effects of perceptual features on individual emotion predictions, and we argue that the small loss in performance in going through the mid-level features is justified by the gain in explainability of the predictions.

READ FULL TEXT

page 4

page 6

research
03/03/2023

Decoding and Visualising Intended Emotion in an Expressive Piano Performance

Expert musicians can mould a musical piece to convey specific emotions t...
research
07/28/2021

On Perceived Emotion in Expressive Piano Performance: Further Experimental Evidence for the Relevance of Mid-level Perceptual Features

Despite recent advances in audio content-based music emotion recognition...
research
06/14/2021

Tracing Back Music Emotion Predictions to Sound Sources and Intuitive Perceptual Qualities

Music emotion recognition is an important task in MIR (Music Information...
research
05/28/2019

Two-level Explanations in Music Emotion Recognition

Current ML models for music emotion recognition, while generally working...
research
06/13/2018

A data-driven approach to mid-level perceptual musical feature modeling

Musical features and descriptors could be coarsely divided into three le...
research
05/24/2022

Singer Identification for Metaverse with Timbral and Middle-Level Perceptual Features

Metaverse is an interactive world that combines reality and virtuality, ...
research
08/16/2018

Neural Networks Assist Crowd Predictions in Discerning the Veracity of Emotional Expressions

Crowd predictions have demonstrated powerful performance in predicting f...

Please sign up or login with your details

Forgot password? Click here to reset