Optimal Condition Training for Target Source Separation

11/11/2022
by   Efthymios Tzinis, et al.
0

Recent research has shown remarkable performance in leveraging multiple extraneous conditional and non-mutually exclusive semantic concepts for sound source separation, allowing the flexibility to extract a given target source based on multiple different queries. In this work, we propose a new optimal condition training (OCT) method for single-channel target source separation, based on greedy parameter updates using the highest performing condition among equivalent conditions associated with a given target source. Our experiments show that the complementary information carried by the diverse semantic concepts significantly helps to disentangle and isolate sources of interest much more efficiently compared to single-conditioned models. Moreover, we propose a variation of OCT with condition refinement, in which an initial conditional vector is adapted to the given mixture and transformed to a more amenable representation for target source extraction. We showcase the effectiveness of OCT on diverse source separation experiments where it improves upon permutation invariant models with oracle assignment and obtains state-of-the-art performance in the more challenging task of text-based source separation, outperforming even dedicated text-only conditioned models.

READ FULL TEXT
research
04/07/2022

Heterogeneous Target Speech Separation

We introduce a new paradigm for single-channel target source separation ...
research
07/27/2023

Complete and separate: Conditional separation with missing target source attribute completion

Recent approaches in source separation leverage semantic information abo...
research
10/22/2020

LaSAFT: Latent Source Attentive Frequency Transformation for Conditioned Source Separation

Recent deep-learning approaches have shown that Frequency Transformation...
research
08/19/2019

Audio query-based music source separation

In recent years, music source separation has been one of the most intens...
research
10/21/2022

Adversarial Permutation Invariant Training for Universal Sound Separation

Universal sound separation consists of separating mixes with arbitrary s...
research
03/23/2020

Multi-channel U-Net for Music Source Separation

A fairly straightforward approach for music source separation is to trai...
research
07/30/2021

Speeding Up Permutation Invariant Training for Source Separation

Permutation invariant training (PIT) is a widely used training criterion...

Please sign up or login with your details

Forgot password? Click here to reset