Invariances and Data Augmentation for Supervised Music Transcription

11/13/2017
by   John Thickstun, et al.
0

This paper explores a variety of models for frame-based music transcription, with an emphasis on the methods needed to reach state-of-the-art on human recordings. The translation-invariant network discussed in this paper, which combines a traditional filterbank with a convolutional neural network, was the top-performing model in the 2017 MIREX Multiple Fundamental Frequency Estimation evaluation. This class of models shares parameters in the log-frequency domain, which exploits the frequency invariance of music to reduce the number of model parameters and avoid overfitting to the training data. All models in this paper were trained with supervision by labeled data from the MusicNet dataset, augmented by random label-preserving pitch-shift transformations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/17/2022

Extract fundamental frequency based on CNN combined with PYIN

This paper refers to the extraction of multiple fundamental frequencies ...
research
01/25/2020

The impact of Audio input representations on neural network based music transcription

This paper thoroughly analyses the effect of different input representat...
research
08/05/2020

Learning to Denoise Historical Music

We propose an audio-to-audio neural network model that learns to denoise...
research
06/23/2020

Incorporating Music Knowledge in Continual Dataset Augmentation for Music Generation

Deep learning has rapidly become the state-of-the-art approach for music...
research
04/22/2018

Tempo-Invariant Processing of Rhythm with Convolutional Neural Networks

Rhythm patterns can be performed with a wide variation of tempi. This pr...
research
11/12/2019

Music Auto-tagging Using CNNs and Mel-spectrograms With Reduced Frequency and Time Resolution

Automatic tagging of music is an important research topic in Music Infor...
research
10/27/2021

Exploring single-song autoencoding schemes for audio-based music structure analysis

The ability of deep neural networks to learn complex data relations and ...

Please sign up or login with your details

Forgot password? Click here to reset