The Chamber Ensemble Generator: Limitless High-Quality MIR Data via Generative Modeling

09/28/2022
by   Yusong Wu, et al.
9

Data is the lifeblood of modern machine learning systems, including for those in Music Information Retrieval (MIR). However, MIR has long been mired by small datasets and unreliable labels. In this work, we propose to break this bottleneck using generative modeling. By pipelining a generative model of notes (Coconet trained on Bach Chorales) with a structured synthesis model of chamber ensembles (MIDI-DDSP trained on URMP), we demonstrate a system capable of producing unlimited amounts of realistic chorale music with rich annotations including mixes, stems, MIDI, note-level performance attributes (staccato, vibrato, etc.), and even fine-grained synthesis parameters (pitch, amplitude, etc.). We call this system the Chamber Ensemble Generator (CEG), and use it to generate a large dataset of chorales from four different chamber ensembles (CocoChorales). We demonstrate that data generated using our approach improves state-of-the-art models for music transcription and source separation, and we release both the system and the dataset as an open-source foundation for future work in the MIR community.

READ FULL TEXT

page 2

page 7

research
05/18/2023

RMSSinger: Realistic-Music-Score based Singing Voice Synthesis

We are interested in a challenging task, Realistic-Music-Score based Sin...
research
10/25/2021

Unsupervised Source Separation By Steering Pretrained Music Models

We showcase an unsupervised method that repurposes deep models trained f...
research
08/03/2020

Multitask learning for instrument activation aware music source separation

Music source separation is a core task in music information retrieval wh...
research
02/04/2023

Multi-Source Diffusion Models for Simultaneous Music Generation and Separation

In this work, we define a diffusion-based generative model capable of bo...
research
02/12/2022

Deep Performer: Score-to-Audio Music Performance Synthesis

Music performance synthesis aims to synthesize a musical score into a na...
research
07/24/2022

HouseX: A Fine-grained House Music Dataset and its Potential in the Music Industry

Machine sound classification has been one of the fundamental tasks of mu...
research
09/11/2023

Ensemble-based modeling abstractions for modern self-optimizing systems

In this paper, we extend our ensemble-based component model DEECo with t...

Please sign up or login with your details

Forgot password? Click here to reset