Does Phase Matter For Monaural Source Separation?

11/02/2017
by   Mohit Dubey, et al.
0

The "cocktail party" problem of fully separating multiple sources from a single channel audio waveform remains unsolved. Current biological understanding of neural encoding suggests that phase information is preserved and utilized at every stage of the auditory pathway. However, current computational approaches primarily discard phase information in order to mask amplitude spectrograms of sound. In this paper, we seek to address whether preserving phase information in spectral representations of sound provides better results in monaural separation of vocals from a musical track by using a neurally plausible sparse generative model. Our results demonstrate that preserving phase information reduces artifacts in the separated tracks, as quantified by the signal to artifact ratio (GSAR). Furthermore, our proposed method achieves state-of-the-art performance for source separation, as quantified by a mean signal to interference ratio (GSIR) of 19.46.

READ FULL TEXT
research
11/03/2020

Complex ratio masking for singing voice separation

Music source separation is important for applications such as karaoke an...
research
07/07/2018

Improving DNN-based Music Source Separation using Phase Features

Music source separation with deep neural networks typically relies only ...
research
10/11/2021

Source Mixing and Separation Robust Audio Steganography

Audio steganography aims at concealing secret information in carrier aud...
research
12/14/2022

Tackling the Cocktail Fork Problem for Separation and Transcription of Real-World Soundtracks

Emulating the human ability to solve the cocktail party problem, i.e., f...
research
03/13/2019

Phase-aware Harmonic/Percussive Source Separation via Convex Optimization

Decomposition of an audio mixture into harmonic and percussive component...
research
02/14/2020

Deep S^3PR: Simultaneous Source Separation and Phase Retrieval Using Deep Generative Models

This paper introduces and solves the simultaneous source separation and ...
research
03/14/2023

Multi-Channel Masking with Learnable Filterbank for Sound Source Separation

This work proposes a learnable filterbank based on a multi-channel maski...

Please sign up or login with your details

Forgot password? Click here to reset