Deep Layered Learning in MIR

04/18/2018
by   Anders Elowsson, et al.
0

Deep learning has boosted the performance of many music information retrieval (MIR) systems in recent years. Yet, the complex hierarchical arrangement of music makes end-to-end learning hard for some MIR tasks - a very deep and structurally flexible processing chain is necessary to extract high-level features from a spectrogram representation. Mid-level representations such as tones, pitched onsets, chords, and beats are fundamental building blocks of music. This paper discusses how these can be used as intermediate representations in MIR to facilitate deep processing that generalizes well: each music concept is predicted individually in learning modules that are connected through latent representations in a directed acyclic graph. It is suggested that this strategy for inference, defined as deep layered learning (DLL), can help generalization by (1) - enforcing the validity of intermediate representations during processing, and by (2) - letting the inferred representations establish disentangled structures that support high-level invariant processing. A background to DLL and modular music processing is provided, and relevant concepts such as pruning, skip-connections, and layered performance supervision are reviewed.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/13/2017

A Tutorial on Deep Learning for Music Information Retrieval

Following their success in Computer Vision and other areas, deep learnin...
research
02/14/2019

Multimodal music information processing and retrieval: survey and future challenges

Towards improving the performance in various music information processin...
research
11/08/2018

Learning Disentangled Representations for Timber and Pitch in Music Audio

Timbre and pitch are the two main perceptual properties of musical sound...
research
07/29/2020

Music FaderNets: Controllable Music Generation Based On High-Level Features via Low-Level Feature Modelling

High-level musical qualities (such as emotion) are often abstract, subje...
research
06/13/2018

A data-driven approach to mid-level perceptual musical feature modeling

Musical features and descriptors could be coarsely divided into three le...
research
07/21/2022

Learning Unsupervised Hierarchies of Audio Concepts

Music signals are difficult to interpret from their low-level features, ...
research
06/02/2020

A Layered Learning Approach to Scaling in Learning Classifier Systems for Boolean Problems

Learning classifier systems (LCSs) originated from cognitive-science res...

Please sign up or login with your details

Forgot password? Click here to reset