Towards Improving Harmonic Sensitivity and Prediction Stability for Singing Melody Extraction

08/04/2023
by   Keren Shao, et al.
0

In deep learning research, many melody extraction models rely on redesigning neural network architectures to improve performance. In this paper, we propose an input feature modification and a training objective modification based on two assumptions. First, harmonics in the spectrograms of audio data decay rapidly along the frequency axis. To enhance the model's sensitivity on the trailing harmonics, we modify the Combined Frequency and Periodicity (CFP) representation using discrete z-transform. Second, the vocal and non-vocal segments with extremely short duration are uncommon. To ensure a more stable melody contour, we design a differentiable loss function that prevents the model from predicting such segments. We apply these modifications to several models, including MSNet, FTANet, and a newly introduced model, PianoNet, modified from a piano transcription network. Our experimental results demonstrate that the proposed modifications are empirically effective for singing melody extraction.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/16/2018

Genre-Agnostic Key Classification With Convolutional Neural Networks

We propose modifications to the model structure and training procedure t...
research
08/06/2021

Simple Modifications to Improve Tabular Neural Networks

There is growing interest in neural network architectures for tabular da...
research
04/02/2022

Improving Target Sound Extraction with Timestamp Information

Target sound extraction (TSE) aims to extract the sound part of a target...
research
01/31/2023

Fourier Sensitivity and Regularization of Computer Vision Models

Recent work has empirically shown that deep neural networks latch on to ...
research
08/16/2018

Improved Chord Recognition by Combining Duration and Harmonic Language Models

Chord recognition systems typically comprise an acoustic model that pred...
research
02/02/2022

TONet: Tone-Octave Network for Singing Melody Extraction from Polyphonic Music

Singing melody extraction is an important problem in the field of music ...

Please sign up or login with your details

Forgot password? Click here to reset