wav2shape: Hearing the Shape of a Drum Machine

07/20/2020
by   Han Han, et al.
0

Disentangling and recovering physical attributes, such as shape and material, from a few waveform examples is a challenging inverse problem in audio signal processing, with numerous applications in musical acoustics as well as structural engineering. We propose to address this problem via a combination of time–frequency analysis and supervised machine learning. We start by synthesizing a dataset of sounds using the functional transformation method. Then, we represent each percussive sound in terms of its time-invariant scattering transform coefficients and formulate the parametric estimation of the resonator as multidimensional regression with a deep convolutional neural network. We interpolate scattering coefficients over the surface of the drum as a surrogate for potentially missing data, and study the response of the neural network to interpolated samples. Lastly, we resynthesize drum sounds from scattering coefficients, therefore paving the way towards a deep generative model of drum sounds whose latent variables are physically interpretable.

READ FULL TEXT

page 1

page 5

page 10

research
09/01/2015

Transformée en scattering sur la spirale temps-chroma-octave

We introduce a scattering representation for the analysis and classifica...
research
07/24/2018

Classification with Joint Time-Frequency Scattering

In time series classification, signals are typically mapped into some in...
research
03/19/2022

Modelling nonlinear dependencies in the latent space of inverse scattering

The problem of inverse scattering proposed by Angles and Mallat in 2018,...
research
01/24/2023

Mesostructures: Beyond Spectrogram Loss in Differentiable Time-Frequency Analysis

Computer musicians refer to mesostructures as the intermediate levels of...
research
05/01/2018

Solid Harmonic Wavelet Scattering for Predictions of Molecule Properties

We present a machine learning algorithm for the prediction of molecule p...
research
04/18/2022

Differentiable Time-Frequency Scattering in Kymatio

Joint time-frequency scattering (JTFS) is a convolutional operator in th...
research
07/24/2018

A Hybrid of Deep Audio Feature and i-vector for Artist Recognition

Artist recognition is a task of modeling the artist's musical style. Thi...

Please sign up or login with your details

Forgot password? Click here to reset