WaveBeat: End-to-end beat and downbeat tracking in the time domain

10/04/2021
by   Christian J. Steinmetz, et al.
0

Deep learning approaches for beat and downbeat tracking have brought advancements. However, these approaches continue to rely on hand-crafted, subsampled spectral features as input, restricting the information available to the model. In this work, we propose WaveBeat, an end-to-end approach for joint beat and downbeat tracking operating directly on waveforms. This method forgoes engineered spectral features, and instead, produces beat and downbeat predictions directly from the waveform, the first of its kind for this task. Our model utilizes temporal convolutional networks (TCNs) operating on waveforms that achieve a very large receptive field (≥ 30 s) at audio sample rates in a memory efficient manner by employing rapidly growing dilation factors with fewer layers. With a straightforward data augmentation strategy, our method outperforms previous state-of-the-art methods on some datasets, while producing comparable results on others, demonstrating the potential for time domain approaches.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/07/2013

End-to-end Phoneme Sequence Recognition using Convolutional Neural Networks

Most phoneme recognition state-of-the-art systems rely on a classical ne...
research
02/28/2017

Deep Image Harmonization

Compositing is one of the most common operations in photo editing. To ge...
research
07/23/2015

Deep Fishing: Gradient Features from Deep Nets

Convolutional Networks (ConvNets) have recently improved image recogniti...
research
04/18/2019

SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition

We present SpecAugment, a simple data augmentation method for speech rec...
research
10/05/2022

Deep learning for ECoG brain-computer interface: end-to-end vs. hand-crafted features

In brain signal processing, deep learning (DL) models have become common...
research
03/08/2023

DNBP: Differentiable Nonparametric Belief Propagation

We present a differentiable approach to learn the probabilistic factors ...
research
08/02/2020

Deep Visual Odometry with Adaptive Memory

We propose a novel deep visual odometry (VO) method that considers globa...

Please sign up or login with your details

Forgot password? Click here to reset