Adversarial Learning for Improved Onsets and Frames Music Transcription

06/20/2019
by   Jong Wook Kim, et al.
0

Automatic music transcription is considered to be one of the hardest problems in music information retrieval, yet recent deep learning approaches have achieved substantial improvements on transcription performance. These approaches commonly employ supervised learning models that predict various time-frequency representations, by minimizing element-wise losses such as the cross entropy function. However, applying the loss in this manner assumes conditional independence of each label given the input, and thus cannot accurately express inter-label dependencies. To address this issue, we introduce an adversarial training scheme that operates directly on the time-frequency representations and makes the output distribution closer to the ground-truth. Through adversarial learning, we achieve a consistent improvement in both frame-level and note-level metrics over Onsets and Frames, a state-of-the-art music transcription model. Our results show that adversarial learning can significantly reduce the error rate while increasing the confidence of the model estimations. Our approach is generic and applicable to any transcription model based on multi-label predictions, which are very common in music signal analysis.

READ FULL TEXT
research
12/04/2018

Singing Voice Separation Using a Deep Convolutional Neural Network Trained by Ideal Binary Mask and Cross Entropy

Separating a singing voice from its music accompaniment remains an impor...
research
11/12/2019

Music Auto-tagging Using CNNs and Mel-spectrograms With Reduced Frequency and Time Resolution

Automatic tagging of music is an important research topic in Music Infor...
research
01/16/2018

Automatic Classification of Music Genre using Masked Conditional Neural Networks

Neural network based architectures used for sound recognition are usuall...
research
01/15/2019

Classical Music Generation in Distinct Dastgahs with AlimNet ACGAN

In this paper AlimNet (With respect to great musician, Alim Qasimov) an ...
research
09/05/2017

ALICE: Towards Understanding Adversarial Learning for Joint Distribution Matching

We investigate the non-identifiability issues associated with bidirectio...
research
01/30/2023

DanceAnyWay: Synthesizing Mixed-Genre 3D Dance Movements Through Beat Disentanglement

We present DanceAnyWay, a hierarchical generative adversarial learning m...
research
10/20/2020

The Effect of Spectrogram Reconstruction on Automatic Music Transcription: An Alternative Approach to Improve Transcription Accuracy

Most of the state-of-the-art automatic music transcription (AMT) models ...

Please sign up or login with your details

Forgot password? Click here to reset