Gamma Boltzmann Machine for Simultaneously Modeling Linear- and Log-amplitude Spectra

06/24/2020
by   Toru Nakashika, et al.
0

In audio applications, one of the most important representations of audio signals is the amplitude spectrogram. It is utilized in many machine-learning-based information processing methods including the ones using the restricted Boltzmann machines (RBM). However, the ordinary Gaussian-Bernoulli RBM (the most popular RBM among its variations) cannot directly handle amplitude spectra because the Gaussian distribution is a symmetric model allowing negative values which never appear in the amplitude. In this paper, after proposing a general gamma Boltzmann machine, we propose a practical model called the gamma-Bernoulli RBM that simultaneously handles both linear- and log-amplitude spectrograms. Its conditional distribution of the observable data is given by the gamma distribution, and thus the proposed RBM can naturally handle the data represented by positive numbers as the amplitude spectra. It can also treat amplitude in the logarithmic scale which is important for audio signals from the perceptual point of view. The advantage of the proposed model compared to the ordinary Gaussian-Bernoulli RBM was confirmed by PESQ and MSE in the experiment of representing the amplitude spectrograms of speech signals.

READ FULL TEXT

page 1

page 5

research
05/04/2020

Complex Amplitude-Phase Boltzmann Machines

We extend the framework of Boltzmann machines to a network of complex-va...
research
03/27/2018

Complex-Valued Restricted Boltzmann Machine for Direct Speech Parameterization from Complex Spectra

This paper describes a novel energy-based probabilistic distribution tha...
research
07/23/2019

Log Complex Color for Visual Pattern Recognition of Total Sound

While traditional audio visualization methods depict amplitude intensiti...
research
03/13/2019

Low-rankness of Complex-valued Spectrogram and Its Application to Phase-aware Audio Processing

Low-rankness of amplitude spectrograms has been effectively utilized in ...
research
06/15/2020

A Generalized Gaussian Extension to the Rician Distribution for SAR Image Modeling

In this paper, we present a novel statistical model, the generalized-Gau...
research
07/14/2018

Gamma Spaces and Information

We investigate the role of Segal's Gamma-spaces in the context of classi...
research
05/08/2021

Comparative analysis of the original and amplitude permutations

We compare the two basic ordinal patterns, i.e., the original and amplit...

Please sign up or login with your details

Forgot password? Click here to reset