Symbolic Music Data Version 1.0

06/08/2016
by   Christian Walder, et al.
0

In this document, we introduce a new dataset designed for training machine learning models of symbolic music data. Five datasets are provided, one of which is from a newly collected corpus of 20K midi files. We describe our preprocessing and cleaning pipeline, which includes the exclusion of a number of files based on scores from a previously developed probabilistic machine learning model. We also define training, testing and validation splits for the new dataset, based on a clustering scheme which we also describe. Some simple histograms are included.

READ FULL TEXT
research
12/31/2021

InverseMV: Composing Piano Scores with a Convolutional Video-Music Transformer

Many social media users prefer consuming content in the form of videos r...
research
03/09/2021

Learning to Generate Music With Sentiment

Deep Learning models have shown very promising results in automatically ...
research
06/23/2023

DISCO-10M: A Large-Scale Music Dataset

Music datasets play a crucial role in advancing research in machine lear...
research
09/09/2020

A dataset and classification model for Malay, Hindi, Tamil and Chinese music

In this paper we present a new dataset, with musical excepts from the th...
research
02/20/2022

towards automatic transcription of polyphonic electric guitar music:a new dataset and a multi-loss transformer model

In this paper, we propose a new dataset named EGDB, that con-tains trans...
research
07/29/2020

dMelodies: A Music Dataset for Disentanglement Learning

Representation learning focused on disentangling the underlying factors ...
research
05/30/2019

A Music Classification Model based on Metric Learning and Feature Extraction from MP3 Audio Files

The development of models for learning music similarity and feature extr...

Please sign up or login with your details

Forgot password? Click here to reset