Controlling the Perceived Sound Quality for Dialogue Enhancement with Deep Learning

07/22/2021
by   Christian Uhle, et al.
0

Speech enhancement attenuates interfering sounds in speech signals but may introduce artifacts that perceivably deteriorate the output signal. We propose a method for controlling the trade-off between the attenuation of the interfering background signal and the loss of sound quality. A deep neural network estimates the attenuation of the separated background signal such that the sound quality, quantified using the Artifact-related Perceptual Score, meets an adjustable target. Subjective evaluations indicate that consistent sound quality is obtained across various input signals. Our experiments show that the proposed method is able to control the trade-off with an accuracy that is adequate for real-world dialogue enhancement applications.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/02/2021

Reduction of Subjective Listening Effort for TV Broadcast Signals with Recurrent Neural Networks

Listening to the audio of TV broadcast signals can be challenging for he...
research
02/14/2020

Stable Training of DNN for Speech Enhancement based on Perceptually-Motivated Black-Box Cost Function

Improving subjective sound quality of enhanced signals is one of the mos...
research
06/16/2022

To Dereverb Or Not to Dereverb? Perceptual Studies On Real-Time Dereverberation Targets

In real life, room effect, also known as room reverberation, and the pre...
research
12/10/2018

An individualized super Gaussian single microphone Speech Enhancement for hearing aid users with smartphone as an assistive device

In this letter, we derive a new super Gaussian Joint Maximum a Posterior...
research
07/21/2021

Controlling the Remixing of Separated Dialogue with a Non-Intrusive Quality Estimate

Remixing separated audio sources trades off interferer attenuation again...
research
06/07/2022

Universal Speech Enhancement with Score-based Diffusion

Removing background noise from speech audio has been the subject of cons...
research
09/19/2019

Robot Sound Interpretation: Combining Sight and Sound in Learning-Based Control

We explore the interpretation of sound for robot decision-making, inspir...

Please sign up or login with your details

Forgot password? Click here to reset