Frequency domain variants of velvet noise and their application to speech processing and synthesis: with appendices

06/18/2018
by   Hideki Kawahara, et al.
Meijo University
University of Yamanashi
Health Sciences University of Hokkaido
Wakayama University
0

We propose a new excitation source signal for VOCODERs and an all-pass impulse response for post-processing of synthetic sounds and pre-processing of natural sounds for data-augmentation. The proposed signals are variants of velvet noise, which is a sparse discrete signal consisting of a few non-zero (1 or -1) elements and sounds smoother than Gaussian white noise. One of the proposed variants, FVN (Frequency domain Velvet Noise) applies the procedure to generate a velvet noise on the cyclic frequency domain of DFT (Discrete Fourier Transform). Then, by smoothing the generated signal to design the phase of an all-pass filter followed by inverse Fourier transform yields the proposed FVN. Temporally variable frequency weighted mixing of FVN generated by frozen and shuffled random number provides a unified excitation signal which can span from random noise to a repetitive pulse train. The other variant, which is an all-pass impulse response, significantly reduces "buzzy" impression of VOCODER output by filtering. Finally, we will discuss applications of the proposed signal for watermarking and psychoacoustic research.

READ FULL TEXT

page 7

page 8

page 9

page 11

12/11/2017

Robust Sparse Fourier Transform Based on The Fourier Projection-Slice Theorem

The state-of-the-art automotive radars employ multidimensional discrete ...
02/01/2019

Multi-layered Cepstrum for Instantaneous Frequency Estimation

We propose the multi-layered cepstrum (MLC) method to estimate multiple ...
02/08/2022

A covariant, discrete time-frequency representation tailored for zero-based signal detection

Recent work in time-frequency analysis proposed to switch the focus from...
08/24/2021

Efficient computation of the zeros of the Bargmann transform under additive white noise

We study the computation of the zero set of the Bargmann transform of a ...
06/05/2018

Sampling and Super-resolution of Sparse Signals Beyond the Fourier Domain

Recovering a sparse signal from its low-pass projections in the Fourier ...
06/14/2023

FTIO: Detecting I/O Periodicity Using Frequency Techniques

Characterizing the temporal I/O behavior of an HPC application is a chal...

Please sign up or login with your details

Forgot password? Click here to reset