Multi-Task Deep Residual Echo Suppression with Echo-aware Loss

02/14/2022
by   Shimin Zhang, et al.
0

This paper introduces the NWPU Team's entry to the ICASSP 2022 AEC Challenge. We take a hybrid approach that cascades a linear AEC with a neural post-filter. The former is used to deal with the linear echo components while the latter suppresses the residual non-linear echo components. We use gated convolutional F-T-LSTM neural network (GFTNN) as the backbone and shape the post-filter by a multi-task learning (MTL) framework, where a voice activity detection (VAD) module is adopted as an auxiliary task along with echo suppression, with the aim to avoid over suppression that may cause speech distortion. Moreover, we adopt an echo-aware loss function, where the mean square error (MSE) loss can be optimized particularly for every time-frequency bin (TF-bin) according to the signal-to-echo ratio (SER), leading to further suppression on the echo. Extensive ablation study shows that the time delay estimation (TDE) module in neural post-filter leads to better perceptual quality, and an adaptive filter with better convergence will bring consistent performance gain for the post-filter. Besides, we find that using the linear echo as the input of our neural post-filter is a better choice than using the reference signal directly. In the ICASSP 2022 AEC-Challenge, our approach has ranked the 1st place on word accuracy (WAcc) (0.817) and the 3rd place on both mean opinion score (MOS) (4.502) and the final score (0.864).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/29/2020

Residual acoustic echo suppression based on efficient multi-task convolutional neural network

Acoustic echo degrades the user experience in voice communication system...
research
02/17/2021

Weighted Recursive Least Square Filter and Neural Network based Residual Echo Suppression for the AEC-Challenge

This paper presents a real-time Acoustic Echo Cancellation (AEC) algorit...
research
03/11/2023

TaylorAECNet: A Taylor Style Neural Network for Full-Band Echo Cancellation

This paper describes aecX team's entry to the ICASSP 2023 acoustic echo ...
research
10/12/2020

Enhancement Of Coded Speech Using a Mask-Based Post-Filter

The quality of speech codecs deteriorates at low bitrates due to high qu...
research
12/30/2020

An Efficient QP Variable Convolutional Neural Network Based In-loop Filter for Intra Coding

In this paper, a novel QP variable convolutional neural network based in...
research
05/19/2020

Acoustic Echo Cancellation by Combining Adaptive Digital Filter and Recurrent Neural Network

Acoustic Echo Cancellation (AEC) plays a key role in voice interaction. ...
research
06/24/2021

A Simultaneous Denoising and Dereverberation Framework with Target Decoupling

Background noise and room reverberation are regarded as two major factor...

Please sign up or login with your details

Forgot password? Click here to reset