Danna-Sep: Unite to separate them all

12/07/2021
by   Chin-Yun Yu, et al.
0

Deep learning-based music source separation has gained a lot of interest in the last decades. Most of the existing methods operate with either spectrograms or waveforms. Spectrogram based models learn suitable masks for separating magnitude spectrogram into different sources, and waveform-based models directly generate waveforms of individual sources. The two types of models have complementary strengths; the former is superior given harmonic sources such as vocals, while the latter demonstrates better results for percussion and bass instruments. In this work, we improved upon the state-of-the-art (SoTA) models and successfully combined the best of both worlds. The backbones of the proposed framework, dubbed Danna-Sep, are two spectrogram-based models including a modified X-UMX and U-Net, and an enhanced Demucs as the waveform-based model. Given an input of mixture, we linearly combined respective outputs from the three models to obtain the final result. We showed in the experiments that, despite its simplicity, Danna-Sep surpassed the SoTA models by a large margin in terms of Source-to-Distortion Ratio.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/27/2019

Music Source Separation in the Waveform Domain

Source separation for music is the task of isolating contributions, or s...
research
04/19/2022

Music Source Separation with Generative Flow

Full supervision models for source separation are trained on mixture-sou...
research
09/03/2019

Demucs: Deep Extractor for Music Sources with extra unlabeled data remixed

We study the problem of source separation for music using deep learning ...
research
11/05/2021

Hybrid Spectrogram and Waveform Source Separation

Source separation models either work on the spectrogram or waveform doma...
research
03/23/2021

Learned complex masks for multi-instrument source separation

Music source separation in the time-frequency domain is commonly achieve...
research
06/30/2022

Implicit Neural Spatial Filtering for Multichannel Source Separation in the Waveform Domain

We present a single-stage casual waveform-to-waveform multichannel model...
research
01/08/2020

Automatic Melody Harmonization with Triad Chords: A Comparative Study

Several prior works have proposed various methods for the task of automa...

Please sign up or login with your details

Forgot password? Click here to reset