Optimizing Short-Time Fourier Transform Parameters via Gradient Descent

10/28/2020
by   An Zhao, et al.
0

The Short-Time Fourier Transform (STFT) has been a staple of signal processing, often being the first step for many audio tasks. A very familiar process when using the STFT is the search for the best STFT parameters, as they often have significant side effects if chosen poorly. These parameters are often defined in terms of an integer number of samples, which makes their optimization non-trivial. In this paper we show an approach that allows us to obtain a gradient for STFT parameters with respect to arbitrary cost functions, and thus enable the ability to employ gradient descent optimization of quantities like the STFT window length, or the STFT hop size. We do so for parameter values that stay constant throughout an input, but also for cases where these parameters have to dynamically change over time to accommodate varying signal characteristics.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/26/2023

Differentiable adaptive short-time Fourier transform with respect to the window length

This paper presents a gradient-based method for on-the-fly optimization ...
research
07/26/2023

Differentiable short-time Fourier transform with respect to the hop length

In this paper, we propose a differentiable version of the short-time Fou...
research
08/23/2022

A differentiable short-time Fourier transform with respect to the window length

In this paper, we revisit the use of spectrograms in neural networks, by...
research
10/28/2020

Frequency-Undersampled Short-Time Fourier Transform

The short-time Fourier transform (STFT) usually computes the same number...
research
04/29/2021

Simulating the DFT Algorithm for Audio Processing

Since the evolution of digital computers, the storage of data has always...
research
12/27/2019

nnAudio: An on-the-fly GPU Audio to Spectrogram Conversion Toolbox Using 1D Convolution Neural Networks

Converting time domain waveforms to frequency domain spectrograms is typ...
research
10/01/2020

Phase retrieval with Bregman divergences and application to audio signal recovery

Phase retrieval (PR) aims to recover a signal from the magnitudes of a s...

Please sign up or login with your details

Forgot password? Click here to reset