Pop Music Highlighter: Marking the Emotion Keypoints

02/28/2018
by   Yu-Siang Huang, et al.
0

The goal of music highlight extraction is to get a short consecutive segment of a piece of music that provides an effective representation of the whole piece. In a previous work, we introduced an attention-based convolutional recurrent neural network that uses music emotion classification as a surrogate task for music highlight extraction, for Pop songs. The rationale behind that approach is that the highlight of a song is usually the most emotional part. This paper extends our previous work in the following two aspects. First, methodology-wise we experiment with a new architecture that does not need any recurrent layers, making the training process faster. Moreover, we compare a late-fusion variant and an early-fusion variant to study which one better exploits the attention mechanism. Second, we conduct and report an extensive set of experiments comparing the proposed attention-based methods against a heuristic energy-based method, a structural repetition-based method, and a few other simple feature-based methods for this task. Due to the lack of public-domain labeled data for highlight extraction, following our previous work we use the RWC POP 100-song data set to evaluate how the detected highlights overlap with any chorus sections of the songs. The experiments demonstrate the effectiveness of our methods over competing methods. For reproducibility, we open source the code and pre-trained model at https://github.com/remyhuang/pop-music-highlighter/.

READ FULL TEXT
research
12/16/2017

Automatic Music Highlight Extraction using Convolutional Recurrent Attention Networks

Music highlights are valuable contents for music services. Most methods ...
research
09/14/2019

musicnn: Pre-trained convolutional neural networks for music audio tagging

Pronounced as "musician", the musicnn library contains a set of pre-trai...
research
10/21/2020

AttendAffectNet: Self-Attention based Networks for Predicting Affective Responses from Movies

In this work, we propose different variants of the self-attention based ...
research
08/22/2020

A Efficient Multimodal Framework for Large Scale Emotion Recognition by Fusing Music and Electrodermal Activity Signals

Considerable attention has been paid for physiological signal-based emot...
research
04/12/2022

ADFF: Attention Based Deep Feature Fusion Approach for Music Emotion Recognition

Music emotion recognition (MER), a sub-task of music information retriev...
research
06/06/2023

Emotion-Conditioned Melody Harmonization with Hierarchical Variational Autoencoder

Existing melody harmonization models have made great progress in improvi...
research
02/12/2020

Constructing a Highlight Classifier with an Attention Based LSTM Neural Network

Data is being produced in larger quantities than ever before in human hi...

Please sign up or login with your details

Forgot password? Click here to reset