A Streamlined Encoder/Decoder Architecture for Melody Extraction

10/30/2018
by   Tsung-Han Hsieh, et al.
0

Melody extraction in polyphonic musical audio is important for music signal processing. In this paper, we propose a novel streamlined encoder/decoder network that is designed for the task. We make two technical contributions. First, drawing inspiration from a state-of-the-art model for semantic pixel-wise segmentation, we pass through the pooling indices between pooling and un-pooling layers to localize the melody in frequency. We can achieve result close to the state-of-the-art with much fewer convolutional layers and simpler convolution modules. Second, we propose a way to use the bottleneck layer of the network to estimate the existence of a melody line for each time frame, and make it possible to use a simple argmax function instead of ad-hoc thresholding to get the final estimation of the melody line. Our experiments on both vocal melody extraction and general melody extraction validate the effectiveness of the proposed model.

READ FULL TEXT
research
09/19/2023

Spatial-Assistant Encoder-Decoder Network for Real Time Semantic Segmentation

Semantic segmentation is an essential technology for self-driving cars t...
research
02/07/2018

Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation

Spatial pyramid pooling module or encode-decoder structure are used in d...
research
09/28/2022

CSSAM: U-net Network for Application and Segmentation of Welding Engineering Drawings

Heavy equipment manufacturing splits specific contours in drawings and c...
research
02/02/2022

TONet: Tone-Octave Network for Singing Melody Extraction from Polyphonic Music

Singing melody extraction is an important problem in the field of music ...
research
08/22/2019

Feedbackward Decoding for Semantic Segmentation

We propose a novel approach for semantic segmentation that uses an encod...
research
05/11/2019

Graph U-Nets

We consider the problem of representation learning for graph data. Convo...
research
07/24/2019

From Big to Small: Multi-Scale Local Planar Guidance for Monocular Depth Estimation

Estimating accurate depth from a single image is challenging, because it...

Please sign up or login with your details

Forgot password? Click here to reset