Cross-modal Music Emotion Recognition Using Composite Loss-based Embeddings

12/14/2021
by   Naoki Takashima, et al.
0

Most music emotion recognition approaches use one-way classification or regression that estimates a general emotion from a distribution of music samples, but without considering emotional variations (e.g., happiness can be further categorised into much, moderate or little happiness). We propose a cross-modal music emotion recognition approach that associates music samples with emotions in a common space by considering both of their general and specific characteristics. Since the association of music samples with emotions is uncertain due to subjective human perceptions, we compute composite loss-based embeddings obtained to maximise two statistical characteristics, one being the correlation between music samples and emotions based on canonical correlation analysis, and the other being a probabilistic similarity between a music sample and an emotion with KL-divergence. Experiments on two benchmark datasets demonstrate the superiority of our approach over one-way baselines. In addition, detailed analysis show that our approach can accomplish robust cross-modal music emotion recognition that not only identifies music samples matching with a specific emotion but also detects emotions expressed in a certain music sample.

READ FULL TEXT

page 11

page 12

research
08/22/2020

Emotion-Based End-to-End Matching Between Image and Music in Valence-Arousal Space

Both images and music can convey rich semantics and are widely used to i...
research
01/06/2021

Transformer-based approach towards music emotion recognition from lyrics

The task of identifying emotions from a given music track has been an ac...
research
12/09/2021

Personalized musically induced emotions of not-so-popular Colombian music

This work presents an initial proof of concept of how Music Emotion Reco...
research
06/27/2021

Use of Variational Inference in Music Emotion Recognition

This work was developed aiming to employ Statistical techniques to the f...
research
11/26/2021

Emotion Embedding Spaces for Matching Music to Stories

Content creators often use music to enhance their stories, as it can be ...
research
04/19/2017

CNN based music emotion classification

Music emotion recognition (MER) is usually regarded as a multi-label tag...
research
02/20/2020

A Comparative Study of Western and Chinese Classical Music based on Soundscape Models

Whether literally or suggestively, the concept of soundscape is alluded ...

Please sign up or login with your details

Forgot password? Click here to reset