Dual-Stage Low-Complexity Reconfigurable Speech Enhancement

05/17/2021
by   Jun Yang, et al.
0

This paper proposes a dual-stage, low complexity, and reconfigurable technique to enhance the speech contaminated by various types of noise sources. Driven by input data and audio contents, the proposed dual-stage speech enhancement approach performs a coarse and fine processing in the first-stage and second-stage, respectively. In this paper, we demonstrate that the proposed speech enhancement solution significantly enhances the metrics of 3-fold QUality Evaluation of Speech in Telecommunication (3QUEST) consisting of speech mean-opinion-score (SMOS) and noise MOS (NMOS) for near-field and far-field applications. Moreover, the proposed speech enhancement approach greatly improves both the signal-to-noise ratio (SNR) and subjective listening experience. For comparisons, the traditional speech enhancement methods reduce the SMOS although they increase NMOS and SNR. In addition, the proposed speech enhancement scheme can be easily adopted in both capture path and speech render path for speech communication and conferencing systems, and voice-trigger applications.

READ FULL TEXT

page 3

page 4

research
06/19/2022

GMM based multi-stage Wiener filtering for low SNR speech enhancement

This paper proposes a single-channel speech enhancement method to reduce...
research
09/17/2019

A scalable noisy speech dataset and online subjective test framework

Background noise is a major source of quality impairments in Voice over ...
research
01/29/2020

Environment-aware Reconfigurable Noise Suppression

The paper proposes an efficient, robust, and reconfigurable technique to...
research
03/21/2023

ICASSP 2023 Deep Noise Suppression Challenge

Deep Speech Enhancement Challenge is the 5th edition of deep noise suppr...
research
04/09/2019

Speech Enhancement with Wide Residual Networks in Reverberant Environments

This paper proposes a speech enhancement method which exploits the high ...
research
09/03/2023

Noise robust speech emotion recognition with signal-to-noise ratio adapting speech enhancement

Speech emotion recognition (SER) often experiences reduced performance d...
research
08/18/2019

A Dual-Staged Context Aggregation Method Towards Efficient End-To-End Speech Enhancement

In speech enhancement, an end-to-end deep neural network converts a nois...

Please sign up or login with your details

Forgot password? Click here to reset