Audio query-based music source separation

08/19/2019
by   Jie Hwan Lee, et al.
0

In recent years, music source separation has been one of the most intensively studied research areas in music information retrieval. Improvements in deep learning lead to a big progress in music source separation performance. However, most of the previous studies are restricted to separating a few limited number of sources, such as vocals, drums, bass, and other. In this study, we propose a network for audio query-based music source separation that can explicitly encode the source information from a query signal regardless of the number and/or kind of target signals. The proposed method consists of a Query-net and a Separator: given a query and a mixture, the Query-net encodes the query into the latent space, and the Separator estimates masks conditioned by the latent vector, which is then applied to the mixture for separation. The Separator can also generate masks using the latent vector from the training samples, allowing separation in the absence of a query. We evaluate our method on the MUSDB18 dataset, and experimental results show that the proposed method can separate multiple sources with a single network. In addition, through further investigation of the latent space we demonstrate that our method can generate continuous outputs via latent vector interpolation.

READ FULL TEXT

page 2

page 4

page 5

research
10/25/2021

Unsupervised Source Separation By Steering Pretrained Music Models

We showcase an unsupervised method that repurposes deep models trained f...
research
03/28/2022

Separate What You Describe: Language-Queried Audio Source Separation

In this paper, we introduce the task of language-queried audio source se...
research
11/26/2021

Learning source-aware representations of music in a discrete latent space

In recent years, neural network based methods have been proposed as a me...
research
11/14/2022

MedleyVox: An Evaluation Dataset for Multiple Singing Voices Separation

Separation of multiple singing voices into each voice is a rarely studie...
research
10/13/2021

Music Source Separation with Deep Equilibrium Models

While deep neural network-based music source separation (MSS) is very ef...
research
11/11/2022

Optimal Condition Training for Target Source Separation

Recent research has shown remarkable performance in leveraging multiple ...
research
03/07/2021

HTMD-Net: A Hybrid Masking-Denoising Approach to Time-Domain Monaural Singing Voice Separation

The advent of deep learning has led to the prevalence of deep neural net...

Please sign up or login with your details

Forgot password? Click here to reset