Melody Extraction from Polyphonic Music by Deep Learning Approaches: A Review

02/02/2022
by   Gurunath Reddy M, et al.
0

Melody extraction is a vital music information retrieval task among music researchers for its potential applications in education pedagogy and the music industry. Melody extraction is a notoriously challenging task due to the presence of background instruments. Also, often melodic source exhibits similar characteristics to that of the other instruments. The interfering background accompaniment with the vocals makes extracting the melody from the mixture signal much more challenging. Until recently, classical signal processing-based melody extraction methods were quite popular among melody extraction researchers. The ability of the deep learning models to model large-scale data and the ability of the models to learn automatic features by exploiting spatial and temporal dependencies inspired many researchers to adopt deep learning models for melody extraction. In this paper, an attempt has been made to review the up-to-date data-driven deep learning approaches for melody extraction from polyphonic music. The available deep models have been categorized based on the type of neural network used and the output representation they use for predicting melody. Further, the architectures of the 25 melody extraction models are briefly presented. The loss functions used to optimize the model parameters of the melody extraction models are broadly categorized into four categories and briefly describe the loss functions used by various melody extraction models. Also, the various input representations adopted by the melody extraction models and the parameter settings are deeply described. A section describing the explainability of the block-box melody extraction deep neural networks is included. The performance of 25 melody extraction methods is compared. The possible future directions to explore/improve the melody extraction methods are also presented in the paper.

READ FULL TEXT

page 32

page 34

page 36

page 39

page 40

research
06/10/2019

Deep Learning-Based Automatic Downbeat Tracking: A Brief Review

As an important format of multimedia, music has filled almost everyone's...
research
04/24/2018

Vocal melody extraction using patch-based CNN

A patch-based convolutional neural network (CNN) model presented in this...
research
09/22/2021

Deep Augmented MUSIC Algorithm for Data-Driven DoA Estimation

Direction of arrival (DoA) estimation is a crucial task in sensor array ...
research
10/31/2020

A review of neural network algorithms and their applications in supercritical extraction

Neural network realizes multi-parameter optimization and control by simu...
research
07/31/2020

Ultra-light deep MIR by trimming lottery tickets

Current state-of-the-art results in Music Information Retrieval are larg...
research
10/07/2020

Improving the efficiency of spectral features extraction by structuring the audio files

The extraction of spectral features from a music clip is a computational...

Please sign up or login with your details

Forgot password? Click here to reset