Robust Downbeat Tracking Using an Ensemble of Convolutional Networks

05/26/2016
by   S. Durand, et al.
0

In this paper, we present a novel state of the art system for automatic downbeat tracking from music signals. The audio signal is first segmented in frames which are synchronized at the tatum level of the music. We then extract different kind of features based on harmony, melody, rhythm and bass content to feed convolutional neural networks that are adapted to take advantage of each feature characteristics. This ensemble of neural networks is combined to obtain one downbeat likelihood per tatum. The downbeat sequence is finally decoded with a flexible and efficient temporal model which takes advantage of the metrical continuity of a song. We then perform an evaluation of our system on a large base of 9 datasets, compare its performance to 4 other published algorithms and obtain a significant increase of 16.8 percent points compared to the second best system, for altogether a moderate cost in test and training. The influence of each step of the method is studied to show its strengths and shortcomings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/14/2016

Convolutional Recurrent Neural Networks for Music Classification

We introduce a convolutional recurrent neural network (CRNN) for music t...
research
10/26/2019

A holistic approach to polyphonic music transcription with neural networks

We present a framework based on neural networks to extract music scores ...
research
06/21/2017

Multi-Level and Multi-Scale Feature Aggregation Using Sample-level Deep Convolutional Neural Networks for Music Classification

Music tag words that describe music audio by text have different levels ...
research
05/01/2018

Randomly weighted CNNs for (music) audio classification

The computer vision literature shows that randomly weighted neural netwo...
research
02/18/2018

Music Genre Classification using Masked Conditional Neural Networks

The ConditionaL Neural Networks (CLNN) and the Masked ConditionaL Neural...
research
04/07/2017

OBTAIN: Real-Time Beat Tracking in Audio Signals

In this paper, we design a system in order to perform the real-time beat...
research
08/17/2022

Extract fundamental frequency based on CNN combined with PYIN

This paper refers to the extraction of multiple fundamental frequencies ...

Please sign up or login with your details

Forgot password? Click here to reset