Harmonic-Percussive Source Separation with Deep Neural Networks and Phase Recovery

07/30/2018
by   Konstantinos Drossos, et al.
0

Harmonic/percussive source separation (HPSS) consists in separating the pitched instruments from the percussive parts in a music mixture. In this paper, we propose to apply the recently introduced Masker-Denoiser with twin networks (MaD TwinNet) system to this task. MaD TwinNet is a deep learning architecture that has reached state-of-the-art results in monaural singing voice separation. Herein, we propose to apply it to HPSS by using it to estimate the magnitude spectrogram of the percussive source. Then, we retrieve the complex-valued short-time Fourier transform of the sources by means of a phase recovery algorithm, which minimizes the reconstruction error and enforces the phase of the harmonic part to follow a sinusoidal phase model. Experiments conducted on realistic music mixtures show that this novel separation system outperforms the previous state-of-the art kernel additive model approach.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/20/2020

Phase recovery with Bregman divergences for audio source separation

Time-frequency audio source separation is usually achieved by estimating...
research
11/03/2020

Complex ratio masking for singing voice separation

Music source separation is important for applications such as karaoke an...
research
03/13/2019

Phase-aware Harmonic/Percussive Source Separation via Convex Optimization

Decomposition of an audio mixture into harmonic and percussive component...
research
07/07/2018

Improving DNN-based Music Source Separation using Phase Features

Music source separation with deep neural networks typically relies only ...
research
11/22/2018

Deep Learning Based Phase Reconstruction for Speaker Separation: A Trigonometric Perspective

This study investigates phase reconstruction for deep learning based mon...
research
05/06/2019

Investigating kernel shapes and skip connections for deep learning-based harmonic-percussive separation

In this paper we propose an efficient deep learning encoder-decoder netw...
research
04/12/2019

Examining the Mapping Functions of Denoising Autoencoders in Music Source Separation

The goal of this work is to investigate what music source separation app...

Please sign up or login with your details

Forgot password? Click here to reset