SANGEET: A XML based Open Dataset for Research in Hindustani Sangeet

06/07/2023
by   Chandan Misra, et al.
0

It is very important to access a rich music dataset that is useful in a wide variety of applications. Currently, available datasets are mostly focused on storing vocal or instrumental recording data and ignoring the requirement of its visual representation and retrieval. This paper attempts to build an XML-based public dataset, called SANGEET, that stores comprehensive information of Hindustani Sangeet (North Indian Classical Music) compositions written by famous musicologist Pt. Vishnu Narayan Bhatkhande. SANGEET preserves all the required information of any given composition including metadata, structural, notational, rhythmic, and melodic information in a standardized way for easy and efficient storage and extraction of musical information. The dataset is intended to provide the ground truth information for music information research tasks, thereby supporting several data-driven analysis from a machine learning perspective. We present the usefulness of the dataset by demonstrating its application on music information retrieval using XQuery, visualization through Omenad rendering system. Finally, we propose approaches to transform the dataset for performing statistical and machine learning tasks for a better understanding of Hindustani Sangeet. The dataset can be found at https://github.com/cmisra/Sangeet.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/17/2023

jazznet: A Dataset of Fundamental Piano Patterns for Music Audio Machine Learning Research

This paper introduces the jazznet Dataset, a dataset of fundamental jazz...
research
05/12/2021

A Statistical Model for Melody Reduction

A commonly-cited reason for the poor performance of automatic chord esti...
research
10/06/2022

AnimeTAB: A new guitar tablature dataset of anime and game music

While guitar tablature has become a popular topic in MIR research, there...
research
12/31/2018

The Music Streaming Sessions Dataset

At the core of many important machine learning problems faced by online ...
research
11/02/2017

Identification of potential Music Information Retrieval technologies for computer-aided jingju singing training

Music Information Retrieval (MIR) technologies have been proven useful i...
research
07/29/2020

dMelodies: A Music Dataset for Disentanglement Learning

Representation learning focused on disentangling the underlying factors ...
research
08/07/2017

STARDATA: A StarCraft AI Research Dataset

We release a dataset of 65646 StarCraft replays that contains 1535 milli...

Please sign up or login with your details

Forgot password? Click here to reset