Frequency domain variants of velvet noise and their application to speech processing and synthesis: with appendices

by   Hideki Kawahara, et al.
Meijo University
University of Yamanashi
Health Sciences University of Hokkaido
Wakayama University

We propose a new excitation source signal for VOCODERs and an all-pass impulse response for post-processing of synthetic sounds and pre-processing of natural sounds for data-augmentation. The proposed signals are variants of velvet noise, which is a sparse discrete signal consisting of a few non-zero (1 or -1) elements and sounds smoother than Gaussian white noise. One of the proposed variants, FVN (Frequency domain Velvet Noise) applies the procedure to generate a velvet noise on the cyclic frequency domain of DFT (Discrete Fourier Transform). Then, by smoothing the generated signal to design the phase of an all-pass filter followed by inverse Fourier transform yields the proposed FVN. Temporally variable frequency weighted mixing of FVN generated by frozen and shuffled random number provides a unified excitation signal which can span from random noise to a repetitive pulse train. The other variant, which is an all-pass impulse response, significantly reduces "buzzy" impression of VOCODER output by filtering. Finally, we will discuss applications of the proposed signal for watermarking and psychoacoustic research.


page 7

page 8

page 9

page 11


Robust Sparse Fourier Transform Based on The Fourier Projection-Slice Theorem

The state-of-the-art automotive radars employ multidimensional discrete ...

Multi-layered Cepstrum for Instantaneous Frequency Estimation

We propose the multi-layered cepstrum (MLC) method to estimate multiple ...

A covariant, discrete time-frequency representation tailored for zero-based signal detection

Recent work in time-frequency analysis proposed to switch the focus from...

Efficient computation of the zeros of the Bargmann transform under additive white noise

We study the computation of the zero set of the Bargmann transform of a ...

Sampling and Super-resolution of Sparse Signals Beyond the Fourier Domain

Recovering a sparse signal from its low-pass projections in the Fourier ...

FTIO: Detecting I/O Periodicity Using Frequency Techniques

Characterizing the temporal I/O behavior of an HPC application is a chal...

Please sign up or login with your details

Forgot password? Click here to reset