DECIBEL: Improving Audio Chord Estimation for Popular Music by Alignment and Integration of Crowd-Sourced Symbolic Representations

02/22/2020
by   Daphne Odekerken, et al.
0

Automatic Chord Estimation (ACE) is a fundamental task in Music Information Retrieval (MIR) and has applications in both music performance and MIR research. The task consists of segmenting a music recording or score and assigning a chord label to each segment. Although it has been a task in the annual benchmarking evaluation MIREX for over 10 years, ACE is not yet a solved problem, since performance has stagnated and modern systems have started to tune themselves to subjective training data. We propose DECIBEL, a new ACE system that exploits widely available MIDI and tab representations to improve ACE from audio only. From an audio file and a set of MIDI and tab files corresponding to the same popular music song, DECIBEL first estimates chord sequences. For audio, state-of-the-art audio ACE methods are used. MIDI files are aligned to the audio, followed by a MIDI chord estimation step. Tab files are transformed into untimed chord sequences and then aligned to the audio. Next, DECIBEL uses data fusion to integrate all estimated chord sequences into one final output sequence. DECIBEL improves all tested state-of-the-art ACE methods by over 3 percent on average. This result shows that the integration of musical knowledge from heterogeneous symbolic music representations is a suitable strategy for addressing challenging MIR tasks such as ACE.

READ FULL TEXT

page 15

page 16

page 18

page 23

page 31

page 32

page 33

page 40

research
07/27/2021

Audio-to-Score Alignment Using Deep Automatic Music Transcription

Audio-to-score alignment (A2SA) is a multimodal task consisting in the a...
research
12/08/2021

Learning music audio representations via weak language supervision

Audio representations for music information retrieval are typically lear...
research
06/21/2018

Learning Transposition-Invariant Interval Features from Symbolic Music and Audio

Many music theoretical constructs (such as scale types, modes, cadences,...
research
11/01/2021

Learning To Generate Piano Music With Sustain Pedals

Recent years have witnessed a growing interest in research related to th...
research
07/27/2021

PKSpell: Data-Driven Pitch Spelling and Key Signature Estimation

We present PKSpell: a data-driven approach for the joint estimation of p...
research
04/06/2022

Late multimodal fusion for image and audio music transcription

Music transcription, which deals with the conversion of music sources in...
research
04/11/2023

Soft Dynamic Time Warping for Multi-Pitch Estimation and Beyond

Many tasks in music information retrieval (MIR) involve weakly aligned d...

Please sign up or login with your details

Forgot password? Click here to reset