Kapre: On-GPU Audio Preprocessing Layers for a Quick Implementation of Deep Neural Network Models with Keras

06/19/2017
by   Keunwoo Choi, et al.
0

We introduce Kapre, Keras layers for audio and music signal preprocessing. Music research using deep neural networks requires a heavy and tedious preprocessing stage, for which audio processing parameters are often ignored in parameter optimisation. To solve this problem, Kapre implements time-frequency conversions, normalisation, and data augmentation as Keras layers. We report simple benchmark results, showing real-time on-GPU preprocessing adds a reasonable amount of computation.

READ FULL TEXT

page 1

page 2

page 3

research
09/06/2017

A Comparison on Audio Signal Preprocessing Methods for Deep Neural Networks on Music Tagging

Deep neural networks (DNN) have been successfully applied for music clas...
research
06/15/2019

Audio-Based Music Classification with DenseNet And Data Augmentation

In recent years, deep learning technique has received intense attention ...
research
05/15/2021

1D CNN Architectures for Music Genre Classification

This paper proposes a 1D residual convolutional neural network (CNN) arc...
research
12/17/2018

Instrument-Independent Dastgah Recognition of Iranian Classical Music Using AzarNet

In this paper, AzarNet, a deep neural network (DNN), is proposed to reco...
research
01/15/2019

Classical Music Generation in Distinct Dastgahs with AlimNet ACGAN

In this paper AlimNet (With respect to great musician, Alim Qasimov) an ...
research
01/31/2023

Automated Time-frequency Domain Audio Crossfades using Graph Cuts

The problem of transitioning smoothly from one audio clip to another ari...
research
01/25/2020

The impact of Audio input representations on neural network based music transcription

This paper thoroughly analyses the effect of different input representat...

Please sign up or login with your details

Forgot password? Click here to reset