Spectrogram-channels u-net: a source separation model viewing each channel as the spectrogram of each source

10/26/2018
by   Jaehoon Oh, et al.
0

Nowadays, the task of sound source separation is an interesting task for Music Information Retrieval(MIR) researchers. Because it is challengeable itself and it is related to many other MIR tasks such as automatic lyric transcription, singer identification, and voice conversion. In this paper, we propose an intuitive spectrogram-based model for source separation by adapting U-Net which was proposed for biomedical image segmentation. We call it Spectrogram-Channels U-Net, which means each channel of the output corresponds to the spectrogram of source itself. The proposed model can be used for not only singing voice separation but also multi-instrument separation by changing only the number of output channels. In addition, we propose a loss function considering balancing between volume of stems. Finally, we get a performance comparable to other state-of-the-art models on both separation tasks.

READ FULL TEXT
research
03/05/2023

Hybrid Y-Net Architecture for Singing Voice Separation

This research paper presents a novel deep learning-based neural network ...
research
11/06/2019

The sound of my voice: speaker representation loss for target voice separation

Research on content and style representations has been widely studied in...
research
07/24/2022

Source Separation of Unknown Numbers of Single-Channel Underwater Acoustic Signals Based on Autoencoders

The separation of single-channel underwater acoustic signals is a challe...
research
11/06/2022

Preserving background sound in noise-robust voice conversion via multi-task learning

Background sound is an informative form of art that is helpful in provid...
research
02/17/2020

Meta-learning Extractors for Music Source Separation

We propose a hierarchical meta-learning-inspired model for music source ...
research
08/12/2020

Channel-wise Subband Input for Better Voice and Accompaniment Separation on High Resolution Music

This paper presents a new input format, channel-wise subband input (CWS)...
research
08/03/2022

Conv-NILM-Net, a causal and multi-appliance model for energy source separation

Non-Intrusive Load Monitoring (NILM) seeks to save energy by estimating ...

Please sign up or login with your details

Forgot password? Click here to reset