Learning Models for Query by Vocal Percussion: A Comparative Study

10/18/2021
by   Alejandro Delgado, et al.
0

The imitation of percussive sounds via the human voice is a natural and effective tool for communicating rhythmic ideas on the fly. Thus, the automatic retrieval of drum sounds using vocal percussion can help artists prototype drum patterns in a comfortable and quick way, smoothing the creative workflow as a result. Here we explore different strategies to perform this type of query, making use of both traditional machine learning algorithms and recent deep learning techniques. The main hyperparameters from the models involved are carefully selected by feeding performance metrics to a grid search algorithm. We also look into several audio data augmentation techniques, which can potentially regularise deep learning models and improve generalisation. We compare the final performances in terms of effectiveness (classification accuracy), efficiency (computational speed), stability (performance consistency), and interpretability (decision patterns), and discuss the relevance of these results when it comes to the design of successful query-by-vocal-percussion systems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/19/2020

On tuning deep learning models: a data mining perspective

Deep learning algorithms vary depending on the underlying connection mec...
research
10/05/2020

A Comparative Study of Existing and New Deep Learning Methods for Detecting Knee Injuries using the MRNet Dataset

This work presents a comparative study of existing and new techniques to...
research
06/26/2022

Data Augmentation for Dementia Detection in Spoken Language

Dementia is a growing problem as our society ages, and detection methods...
research
04/05/2022

Mixing Signals: Data Augmentation Approach for Deep Learning Based Modulation Recognition

With the rapid development of deep learning, automatic modulation recogn...
research
12/18/2018

DeepLens: Towards a Visual Data Management System

Advances in deep learning have greatly widened the scope of automatic co...
research
11/01/2017

Data, Depth, and Design: Learning Reliable Models for Melanoma Screening

State of the art on melanoma screening evolved rapidly in the last two y...
research
01/15/2021

Motion-Based Handwriting Recognition

We attempt to overcome the restriction of requiring a writing surface fo...

Please sign up or login with your details

Forgot password? Click here to reset