LaSAFT: Latent Source Attentive Frequency Transformation for Conditioned Source Separation

10/22/2020
by   Woosung Choi, et al.
8

Recent deep-learning approaches have shown that Frequency Transformation (FT) blocks can significantly improve spectrogram-based single-source separation models by capturing frequency patterns. The goal of this paper is to extend the FT block to fit the multi-source task. We propose the Latent Source Attentive Frequency Transformation (LaSAFT) block to capture source-dependent frequency patterns. We also propose the Gated Point-wise Convolutional Modulation (GPoCM), an extension of Feature-wise Linear Modulation (FiLM), to modulate internal features. By employing these two novel methods, we extend the Conditioned-U-Net (CUNet) for multi-source separation, and the experimental results indicate that our LaSAFT and GPoCM can improve the CUNet's performance, achieving state-of-the-art SDR performance on several MUSDB18 source separation tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/24/2021

LightSAFT: Lightweight Latent Source Aware Frequency Transform for Source Separation

Conditioned source separations have attracted significant attention beca...
research
12/06/2020

Source Separation and Depthwise Separable Convolutions for Computer Audition

Given recent advances in deep music source separation, we propose a feat...
research
03/10/2023

Distribution Preserving Source Separation With Time Frequency Predictive Models

We provide an example of a distribution preserving source separation met...
research
11/11/2022

Optimal Condition Training for Target Source Separation

Recent research has shown remarkable performance in leveraging multiple ...
research
11/22/2022

Latent Iterative Refinement for Modular Source Separation

Traditional source separation approaches train deep neural network model...
research
11/16/2020

Block-Online Guided Source Separation

We propose a block-online algorithm of guided source separation (GSS). G...
research
10/25/2020

Unified Gradient Reweighting for Model Biasing with Applications to Source Separation

Recent deep learning approaches have shown great improvement in audio so...

Please sign up or login with your details

Forgot password? Click here to reset