Noise Classification Aided Attention-Based Neural Network for Monaural Speech Enhancement

05/31/2021
by   Lu Ma, et al.
0

This paper proposes an noise type classification aided attention-based neural network approach for monaural speech enhancement. The network is constructed based on a previous work by introducing a noise classification subnetwork into the structure and taking the classification embedding into the attention mechanism for guiding the network to make better feature extraction. Specifically, to make the network an end-to-end way, an audio encoder and decoder constructed by temporal convolution is used to make transformation between waveform and spectrogram. Additionally, our model is composed of two long short term memory (LSTM) based encoders, two attention mechanism, a noise classifier and a speech mask generator. Experiments show that, compared with OM-LSA and the previous work, the proposed noise classification aided attention-based approach can achieve better performance in terms of speech quality (PESQ). More promisingly, our approach has better generalization ability to unseen noise conditions.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

page 6

research
08/27/2021

Full Attention Bidirectional Deep Learning Structure for Single Channel Speech Enhancement

As the cornerstone of other important technologies, such as speech recog...
research
05/19/2020

Atss-Net: Target Speaker Separation via Attention-based Neural Network

Recently, Convolutional Neural Network (CNN) and Long short-term memory ...
research
09/02/2022

TB or not TB? Acoustic cough analysis for tuberculosis classification

In this work, we explore recurrent neural network architectures for tube...
research
05/31/2021

Multi-Scale Attention Neural Network for Acoustic Echo Cancellation

Acoustic Echo Cancellation (AEC) plays a key role in speech interaction ...
research
07/10/2019

Multi-layer Attention Mechanism for Speech Keyword Recognition

As an important part of speech recognition technology, automatic speech ...
research
07/18/2021

Residual Attention Based Network for Automatic Classification of Phonation Modes

Phonation mode is an essential characteristic of singing style as well a...
research
03/14/2023

TEA-PSE 3.0: Tencent-Ethereal-Audio-Lab Personalized Speech Enhancement System For ICASSP 2023 DNS Challenge

This paper introduces the Unbeatable Team's submission to the ICASSP 202...

Please sign up or login with your details

Forgot password? Click here to reset