Modeling Music Modality with a Key-Class Invariant Pitch Chroma CNN

06/17/2019
by   Anders Elowsson, et al.
0

This paper presents a convolutional neural network (CNN) that uses input from a polyphonic pitch estimation system to predict perceived minor/major modality in music audio. The pitch activation input is structured to allow the first CNN layer to compute two pitch chromas focused on different octaves. The following layers perform harmony analysis across chroma and time scales. Through max pooling across pitch, the CNN becomes invariant with regards to the key class (i.e., key disregarding mode) of the music. A multilayer perceptron combines the modality activation output with spectral features for the final prediction. The study uses a dataset of 203 excerpts rated by around 20 listeners each, a small challenging data size requiring a carefully designed parameter sharing. With an R2 of about 0.71, the system clearly outperforms previous systems as well as individual human listeners. A final ablation study highlights the importance of using pitch activations processed across longer time scales, and using pooling to facilitate invariance with regards to the key class.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/30/2019

Exploiting SIFT Descriptor for Rotation Invariant Convolutional Neural Network

This paper presents a novel approach to exploit the distinctive invarian...
research
02/27/2018

Convolutional Neural Network Achieves Human-level Accuracy in Music Genre Classification

Music genre classification is one example of content-based analysis of m...
research
04/01/2016

Good Practice in CNN Feature Transfer

The objective of this paper is the effective transfer of the Convolution...
research
10/10/2018

A Multimodal Approach towards Emotion Recognition of Music using Audio and Lyrical Content

We propose MoodNet - A Deep Convolutional Neural Network based architect...
research
04/22/2018

Tempo-Invariant Processing of Rhythm with Convolutional Neural Networks

Rhythm patterns can be performed with a wide variation of tempi. This pr...
research
12/17/2017

Using Deep learning methods for generation of a personalized list of shuffled songs

The shuffle mode, where songs are played in a randomized order that is d...

Please sign up or login with your details

Forgot password? Click here to reset