GSEP: A robust vocal and accompaniment separation system using gated CBHG module and loudness normalization

10/23/2020
by   Soochul Park, et al.
0

In the field of audio signal processing research, source separation has been a popular research topic for a long time and the recent adoption of the deep neural networks have shown a significant improvement in performance. The improvement vitalizes the industry to productize audio deep learning based products and services including Karaoke in the music streaming apps and dialogue enhancement in the UHDTV. For these early markets, we defined a set of design principles of the vocal and accompaniment separation model in terms of robustness, quality, and cost. In this paper, we introduce GSEP (Gaudio source SEParation system), a robust vocal and accompaniment separation system using a Gated- CBHG module, mask warping, and loudness normalization and it was verified that the proposed system satisfies all three principles and outperforms the state-of-the-art systems both in objective measure and subjective assessment through experiments.

READ FULL TEXT
research
07/14/2021

Multi-Task Audio Source Separation

The audio source separation tasks, such as speech enhancement, speech se...
research
06/15/2022

On the Use of Deep Mask Estimation Module for Neural Source Separation Systems

Most of the recent neural source separation systems rely on a masking-ba...
research
03/24/2015

Probabilistic Binary-Mask Cocktail-Party Source Separation in a Convolutional Deep Neural Network

Separation of competing speech is a key challenge in signal processing a...
research
09/05/2023

A Generalized Bandsplit Neural Network for Cinematic Audio Source Separation

Cinematic audio source separation is a relatively new subtask of audio s...
research
01/15/2019

Spectrogram Feature Losses for Music Source Separation

In this paper we study deep learning-based music source separation, and ...
research
04/17/2018

The 2018 Signal Separation Evaluation Campaign

This paper reports the organization and results for the 2018 community-b...
research
06/25/2021

Online Self-Attentive Gated RNNs for Real-Time Speaker Separation

Deep neural networks have recently shown great success in the task of bl...

Please sign up or login with your details

Forgot password? Click here to reset