Resource-Efficient Speech Mask Estimation for Multi-Channel Speech Enhancement

07/22/2020
by   Lukas Pfeifenberger, et al.
0

While machine learning techniques are traditionally resource intensive, we are currently witnessing an increased interest in hardware and energy efficient approaches. This need for resource-efficient machine learning is primarily driven by the demand for embedded systems and their usage in ubiquitous computing and IoT applications. In this article, we provide a resource-efficient approach for multi-channel speech enhancement based on Deep Neural Networks (DNNs). In particular, we use reduced-precision DNNs for estimating a speech mask from noisy, multi-channel microphone observations. This speech mask is used to obtain either the Minimum Variance Distortionless Response (MVDR) or Generalized Eigenvalue (GEV) beamformer. In the extreme case of binary weights and reduced precision activations, a significant reduction of execution time and memory footprint is possible while still obtaining an audio quality almost on par to single-precision DNNs and a slightly larger Word Error Rate (WER) for single speaker scenarios using the WSJ0 speech corpus.

READ FULL TEXT

page 9

page 10

research
02/11/2021

Speech enhancement with mixture-of-deep-experts with clean clustering pre-training

In this study we present a mixture of deep experts (MoDE) neural-network...
research
11/14/2019

Speaker independence of neural vocoders and their effect on parametric resynthesis speech enhancement

Traditional speech enhancement systems produce speech with compromised q...
research
06/16/2022

Adversarial Privacy Protection on Speech Enhancement

Speech is easily leaked imperceptibly, such as being recorded by mobile ...
research
12/04/2017

Precision Scaling of Neural Networks for Efficient Audio Processing

While deep neural networks have shown powerful performance in many audio...
research
02/08/2023

Masking Kernel for Learning Energy-Efficient Speech Representation

Modern smartphones are equipped with powerful audio hardware and process...
research
01/07/2020

Resource-Efficient Neural Networks for Embedded Systems

While machine learning is traditionally a resource intensive task, embed...
research
12/05/2018

Efficient and Robust Machine Learning for Real-World Systems

While machine learning is traditionally a resource intensive task, embed...

Please sign up or login with your details

Forgot password? Click here to reset