Image to Image Translation based on Convolutional Neural Network Approach for Speech Declipping

10/26/2019
by   Hamidreza Baradaran Kashani, et al.
0

Clipping, as a current nonlinear distortion, often occurs due to the limited dynamic range of audio recorders. It degrades the speech quality and intelligibility and adversely affects the performances of speech and speaker recognitions. In this paper, we focus on enhancement of clipped speech by using a fully convolutional neural network as U-Net. Motivated by the idea of image-to-image translation, we propose a declipping approach, namely U-Net declipper in which the magnitude spectrum images of clipped signals are translated to the corresponding images of clean ones. The experimental results show that the proposed approach outperforms other declipping methods in terms of both quality and intelligibility measures, especially in severe clipping cases. Moreover, the superior performance of the U-Net declipper over the well-known declipping methods is verified in additive Gaussian noise conditions.

READ FULL TEXT

page 1

page 4

11/05/2019

Speech Enhancement via Deep Spectrum Image Translation Network

Quality and intelligibility of speech signals are degraded under additiv...
10/27/2021

Separating Content and Style for Unsupervised Image-to-Image Translation

Unsupervised image-to-image translation aims to learn the mapping betwee...
05/16/2022

VQBB: Image-to-image Translation with Vector Quantized Brownian Bridge

Image-to-image translation is an important and challenging problem in co...
03/15/2022

Multi-Curve Translator for Real-Time High-Resolution Image-to-Image Translation

The dominant image-to-image translation methods are based on fully convo...
04/07/2020

Direct Speech-to-image Translation

Direct speech-to-image translation without text is an interesting and us...
02/20/2020

Photorealistic Lip Sync with Adversarial Temporal Convolutional Networks

Lip sync has emerged as a promising technique to generate mouth movement...
07/17/2020

SkipConvNet: Skip Convolutional Neural Network for Speech Dereverberation using Optimally Smoothed Spectral Mapping

The reliability of using fully convolutional networks (FCNs) has been su...