A Generative Product-of-Filters Model of Audio

12/20/2013
by   Dawen Liang, et al.
0

We propose the product-of-filters (PoF) model, a generative model that decomposes audio spectra as sparse linear combinations of "filters" in the log-spectral domain. PoF makes similar assumptions to those used in the classic homomorphic filtering approach to signal processing, but replaces hand-designed decompositions built of basic signal processing operations with a learned decomposition based on statistical inference. This paper formulates the PoF model and derives a mean-field method for posterior inference and a variational EM algorithm to estimate the model's free parameters. We demonstrate PoF's potential for audio processing on a bandwidth expansion task, and show that PoF can serve as an effective unsupervised feature extractor for a speaker identification task.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/30/2009

Writer Identification Using Inexpensive Signal Processing Techniques

We propose to use novel and classical audio and text signal-processing a...
research
06/09/2022

Discriminative and Generative Learning for Linear Estimation of Random Signals [Lecture Notes]

Inference tasks in signal processing are often characterized by the avai...
research
09/30/2021

An investigation of pre-upsampling generative modelling and Generative Adversarial Networks in audio super resolution

There have been several successful deep learning models that perform aud...
research
07/15/2020

A survey and an extensive evaluation of popular audio declipping methods

Dynamic range limitations in signal processing often lead to clipping, o...
research
04/27/2021

Visualization of Linear Operations in the Spherical Harmonics Domain

Linear operations on coefficients in the spherical harmonics (SH) transf...
research
10/12/2015

VB calibration to improve the interface between phone recognizer and i-vector extractor

The EM training algorithm of the classical i-vector extractor is often i...
research
05/26/2020

A biomimetic basis for the perception of natural sounds

Arrays of subwavelength resonators can mimic the biomechanical propertie...

Please sign up or login with your details

Forgot password? Click here to reset