Multi-Scale Attention Neural Network for Acoustic Echo Cancellation

05/31/2021
by   Lu Ma, et al.
0

Acoustic Echo Cancellation (AEC) plays a key role in speech interaction by suppressing the echo received at microphone introduced by acoustic reverberations from loudspeakers. Since the performance of linear adaptive filter (AF) would degrade severely due to nonlinear distortions, background noises, and microphone clipping in real scenarios, deep learning has been employed for AEC for its good nonlinear modelling ability. In this paper, we constructed an end-to-end multi-scale attention neural network for AEC. Temporal convolution is first used to transform waveform into spectrogram. The spectrograms of the far-end reference and the near-end mixture are concatenated, and fed to a temporal convolution network (TCN) with stacked dilated convolution layers. Attention mechanism is performed among these representations from different layers to adaptively extract relevant features by referring to the previous hidden state in the encoder long short-term memory (LSTM) unit. The representations are weighted averaged and fed to the encoder LSTM for the near-end speech estimation. Experiments show the superiority of our method in terms of the echo return loss enhancement (ERLE) for single-talk periods and the perceptual evaluation of speech quality (PESQ) score for double-talk periods in background noise and nonlinear distortion scenarios.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

research
05/31/2021

EchoFilter: End-to-End Neural Network for Acoustic Echo Cancellation

Acoustic Echo Cancellation (AEC) whose aim is to suppress the echo origi...
research
05/31/2021

Multi-Scale Temporal Convolution Network for Classroom Voice Detection

Teaching with the cooperation of expert teacher and assistant teacher, w...
research
05/31/2021

Noise Classification Aided Attention-Based Neural Network for Monaural Speech Enhancement

This paper proposes an noise type classification aided attention-based n...
research
03/23/2022

FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement

Previously proposed FullSubNet has achieved outstanding performance in D...
research
06/25/2021

Nonlinear Acoustic Echo Cancellation with Deep Learning

We propose a nonlinear acoustic echo cancellation system, which aims to ...
research
01/13/2021

End-to-End Speaker Height and age estimation using Attention Mechanism with LSTM-RNN

Automatic height and age estimation of speakers using acoustic features ...
research
03/01/2018

TSSD: Temporal Single-Shot Detector Based on Attention and LSTM for Robotic Intelligent Perception

Temporal object detection has attracted significant attention, but most ...

Please sign up or login with your details

Forgot password? Click here to reset