Deep Transform: Time-Domain Audio Error Correction via Probabilistic Re-Synthesis

03/19/2015
by   Andrew J. R. Simpson, et al.
0

In the process of recording, storage and transmission of time-domain audio signals, errors may be introduced that are difficult to correct in an unsupervised way. Here, we train a convolutional deep neural network to re-synthesize input time-domain speech signals at its output layer. We then use this abstract transformation, which we call a deep transform (DT), to perform probabilistic re-synthesis on further speech (of the same speaker) which has been degraded. Using the convolutive DT, we demonstrate the recovery of speech audio that has been subject to extreme degradation. This approach may be useful for correction of errors in communications devices.

READ FULL TEXT
research
09/15/2023

DiaCorrect: Error Correction Back-end For Speaker Diarization

In this work, we propose an error correction framework, named DiaCorrect...
research
07/31/2023

Audio-visual video-to-speech synthesis with synthesized input audio

Video-to-speech synthesis involves reconstructing the speech signal of a...
research
08/26/2020

DeepVOX: Discovering Features from Raw Audio for Speaker Recognition in Degraded Audio Signals

Automatic speaker recognition algorithms typically use pre-defined filte...
research
03/09/2023

Towards Robust Image-in-Audio Deep Steganography

The field of steganography has experienced a surge of interest due to th...
research
03/20/2015

Deep Transform: Cocktail Party Source Separation via Probabilistic Re-Synthesis

In cocktail party listening scenarios, the human brain is able to separa...
research
02/15/2022

Phase Vocoder Done Right

The phase vocoder (PV) is a widely spread technique for processing audio...
research
05/22/2003

Back-propagation of accuracy

In this paper we solve the problem: how to determine maximal allowable e...

Please sign up or login with your details

Forgot password? Click here to reset