Blind Mask to Improve Intelligibility of Non-Stationary Noisy Speech

08/20/2020
by   F. Farias, et al.
0

This letter proposes a novel blind acoustic mask (BAM) designed to adaptively detect noise components and preserve target speech segments in time-domain. A robust standard deviation estimator is applied to the non-stationary noisy speech to identify noise masking elements. The main contribution of the proposed solution is the use of this noise statistics to derive an adaptive information to define and select samples with lower noise proportion. Thus, preserving speech intelligibility. Additionally, no information of the target speech and noise signals statistics is previously required to this non-ideal mask. The BAM and three competitive methods, Ideal Binary Mask (IBM), Target Binary Mask (TBM), and Non-stationary Noise Estimation for Speech Enhancement (NNESE), are evaluated considering speech signals corrupted by three non-stationary acoustic noises and six values of signal-to-noise ratio (SNR). Results demonstrate that the BAM technique achieves intelligibility gains comparable to ideal masks while maintaining good speech quality.

READ FULL TEXT

page 1

page 3

page 4

research
10/07/2019

Adaptive Reverberation Absorption using Non-stationary Masking Components Detection for Intelligibility Improvement

This letter proposes a new time domain absorption approach designed to r...
research
04/23/2019

Harmonic-aligned Frame Mask Based on Non-stationary Gabor Transform with Application to Content-dependent Speaker Comparison

We propose harmonic-aligned frame mask for speech signals using non-stat...
research
04/28/2023

A noise-robust acoustic method for recognition of foraging activities of grazing cattle

To stay competitive in the growing dairy market, farmers must continuous...
research
05/20/2020

Statistical and Neural Network Based Speech Activity Detection in Non-Stationary Acoustic Environments

Speech activity detection (SAD), which often rests on the fact that the ...
research
09/21/2023

Is the Ideal Ratio Mask Really the Best? – Exploring the Best Extraction Performance and Optimal Mask of Mask-based Beamformers

This study investigates mask-based beamformers (BFs), which estimate fil...
research
08/05/2022

AID: Open-source Anechoic Interferer Dataset

A dataset of anechoic recordings of various sound sources encountered in...
research
03/09/2015

Modeling State-Conditional Observation Distribution using Weighted Stereo Samples for Factorial Speech Processing Models

This paper investigates the effectiveness of factorial speech processing...

Please sign up or login with your details

Forgot password? Click here to reset