DMF-Net: A decoupling-style multi-band fusion model for real-time full-band speech enhancement

03/01/2022
by   Guochen Yu, et al.
0

Full-band speech enhancement based on deep neural networks is still challenging for the difficulty of modeling more frequency bands and real-time implementation. Previous studies usually adopt compressed full-band speech features in Bark and ERB scale with relatively low frequency resolution, leading to degraded performance, especially in the high-frequency region. In this paper, we propose a decoupling-style multi-band fusion model to perform full-band speech denoising and dereverberation. Instead of optimizing the full-band speech by a single network structure, we decompose the full-band target into multi sub bands and then employ a multi-stage chain optimization strategy to estimate clean spectrum stage by stage. Specifically, the low- (0-8 kHz), middle- (8-16 kHz), and high-frequency (16-24 kHz) regions are mapped by three separate sub-networks and are then fused to obtain the full-band clean target STFT spectrum. Comprehensive experiments on two public datasets demonstrate that the proposed method outperforms previous advanced systems and yields promising performance in terms of speech quality and intelligibility in real complex scenarios.

READ FULL TEXT

page 1

page 4

research
11/16/2021

S-DCCRN: Super Wide Band DCCRN with learnable complex feature for speech enhancement

In speech enhancement, complex neural network has shown promising perfor...
research
02/05/2022

Optimization of a Real-Time Wavelet-Based Algorithm for Improving Speech Intelligibility

The optimization of a wavelet-based algorithm to improve speech intellig...
research
10/21/2019

Multi-Band Multi-Resolution Fully Convolutional Neural Networks for Singing Voice Separation

Deep neural networks with convolutional layers usually process the entir...
research
01/19/2023

THLNet: two-stage heterogeneous lightweight network for monaural speech enhancement

In this paper, we propose a two-stage heterogeneous lightweight network ...
research
04/27/2021

DPT-FSNet:Dual-path Transformer Based Full-band and Sub-band Fusion Network for Speech Enhancement

Recently, dual-path networks have achieved promising performance due to ...
research
03/11/2023

TaylorAECNet: A Taylor Style Neural Network for Full-Band Echo Cancellation

This paper describes aecX team's entry to the ICASSP 2023 acoustic echo ...
research
05/29/2020

Sub-band Knowledge Distillation Framework for Speech Enhancement

In single-channel speech enhancement, methods based on full-band spectra...

Please sign up or login with your details

Forgot password? Click here to reset