Speech enhancement using ego-noise references with a microphone array embedded in an unmanned aerial vehicle

11/04/2022
by   Elisa Tengan, et al.
0

A method is proposed for performing speech enhancement using ego-noise references with a microphone array embedded in an unmanned aerial vehicle (UAV). The ego-noise reference signals are captured with microphones located near the UAV's propellers and used in the prior knowledge multichannel Wiener filter (PK-MWF) to obtain the speech correlation matrix estimate. Speech presence probability (SPP) can be estimated for detecting speech activity from an external microphone near the speech source, providing a performance benchmark, or from one of the embedded microphones, assuming a more realistic scenario. Experimental measurements are performed in a semi-anechoic chamber, with a UAV mounted on a stand and a loudspeaker playing a speech signal, while setting three distinct and fixed propeller rotation speeds, resulting in three different signal-to-noise ratios (SNRs). The recordings obtained and made available online are used to compare the proposed method to the use of the standard multichannel Wiener filter (MWF) estimated with and without the propellers' microphones being used in its formulation. Results show that compared to those, the use of PK-MWF achieves higher levels of improvement in speech intelligibility and quality, measured by STOI and PESQ, while the SNR improvement is similar.

READ FULL TEXT
research
10/07/2019

Impulsive Noise Detection for Intelligibility and Quality Improvement of Speech Enhancement Methods Applied in Time-Domain

This letter introduces a novel speech enhancement method in the Hilbert-...
research
06/19/2022

GMM based multi-stage Wiener filtering for low SNR speech enhancement

This paper proposes a single-channel speech enhancement method to reduce...
research
11/03/2018

Deep Ad-hoc Beamforming

Deep learning based speech enhancement methods face two problems. First,...
research
05/29/2019

Deep-Learning-Based Audio-Visual Speech Enhancement in Presence of Lombard Effect

When speaking in presence of background noise, humans reflexively change...
research
03/02/2021

DOANet: a deep dilated convolutional neural network approach for search and rescue with drone-embedded sound source localization

Drone-embedded sound source localization (SSL) has interesting applicati...
research
10/11/2017

PROSE: Perceptual Risk Optimization for Speech Enhancement

The goal in speech enhancement is to obtain an estimate of clean speech ...
research
12/19/2017

Flexible Stereo: Constrained, Non-rigid, Wide-baseline Stereo Vision for Fixed-wing Aerial Platforms

This paper proposes a computationally efficient method to estimate the t...

Please sign up or login with your details

Forgot password? Click here to reset