Musical Tempo and Key Estimation using Convolutional Neural Networks with Directional Filters

03/26/2019
by   Hendrik Schreiber, et al.
0

In this article we explore how the different semantics of spectrograms' time and frequency axes can be exploited for musical tempo and key estimation using Convolutional Neural Networks (CNN). By addressing both tasks with the same network architectures ranging from shallow, domain-specific approaches to deep variants with directional filters, we show that axis-aligned architectures perform similarly well as common VGG-style networks developed for computer vision, while being less vulnerable to confounding factors and requiring fewer model parameters.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/31/2018

In-depth Question classification using Convolutional Neural Networks

Convolutional neural networks for computer vision are fairly intuitive. ...
research
11/11/2018

Fashion and Apparel Classification using Convolutional Neural Networks

We present an empirical study of applying deep Convolutional Neural Netw...
research
05/03/2019

Effectiveness of Self Normalizing Neural Networks for Text Classification

Self Normalizing Neural Networks(SNN) proposed on Feed Forward Neural Ne...
research
05/21/2016

Deep convolutional networks on the pitch spiral for musical instrument recognition

Musical performance combines a wide range of pitches, nuances, and expre...
research
07/13/2016

Do semantic parts emerge in Convolutional Neural Networks?

Semantic object parts can be useful for several visual recognition tasks...
research
02/10/2020

Modeling Musical Onset Probabilities via Neural Distribution Learning

Musical onset detection can be formulated as a time-to-event (TTE) or ti...
research
06/14/2015

Compressing Convolutional Neural Networks

Convolutional neural networks (CNN) are increasingly used in many areas ...

Please sign up or login with your details

Forgot password? Click here to reset