Monaural source enhancement maximizing source-to-distortion ratio via automatic differentiation

06/15/2018
by   Hiroaki Nakajima, et al.
0

Recently, deep neural network (DNN) has made a breakthrough in monaural source enhancement. Through a training step by using a large amount of data, DNN estimates a mapping between mixed signals and clean signals. At this time, we use an objective function that numerically expresses the quality of a mapping by DNN. In the conventional methods, L1 norm, L2 norm, and Itakura-Saito divergence are often used as objective functions. Recently, an objective function based on short-time objective intelligibility (STOI) has also been proposed. However, these functions only indicate similarity between the clean signal and the estimated signal by DNN. In other words, they do not show the quality of noise reduction or source enhancement. Motivated by the fact, this paper adopts signal-to-distortion ratio (SDR) as the objective function. Since SDR virtually shows signal-to-noise ratio (SNR), maximizing SDR solves the above problem. The experimental results revealed that the proposed method achieved better performance than the conventional methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/14/2020

Consistency-aware multi-channel speech enhancement using deep neural networks

This paper proposes a deep neural network (DNN)-based multi-channel spee...
research
04/08/2022

Exploiting Hidden Representations from a DNN-based Speech Recogniser for Speech Intelligibility Prediction in Hearing-impaired Listeners

An accurate objective speech intelligibility prediction algorithms is of...
research
04/27/2017

Complex spectrogram enhancement by convolutional neural network with multi-metrics learning

This paper aims to address two issues existing in the current speech enh...
research
04/15/2020

Explaining Regression Based Neural Network Model

Several methods have been proposed to explain Deep Neural Network (DNN)....
research
10/22/2018

DNN-based Source Enhancement to Increase Objective Sound Quality Assessment Score

We propose a training method for deep neural network (DNN)-based source ...
research
02/03/2022

Removing Distortion Effects in Music Using Deep Neural Networks

Audio effects are an essential element in the context of music productio...
research
09/29/2019

Model-aided Deep Neural Network for Source Number Detection

Source number detection is a critical problem in array signal processing...

Please sign up or login with your details

Forgot password? Click here to reset