Solos: A Dataset for Audio-Visual Music Analysis

06/14/2020
by   Juan F. Montesinos, et al.
0

In this paper, we present a new dataset of music performance videos which can be used for training machine learning methods for multiple tasks such as audio-visual blind source separation and localization, cross-modal correspondences, cross-modal generation and, in general, any audio-visual self-supervised task. These videos, gathered from YouTube, consist of solo musical performances of 13 different instruments. Compared to previously proposed audio-visual datasets, Solos is cleaner since a big amount of its recordings are auditions and manually checked recordings, ensuring there is no background noise nor effects added in the video post-processing. Besides, it is, up to the best of our knowledge, the only dataset that contains the whole set of instruments present in the URMP<cit.> dataset, a high-quality dataset of 44 audio-visual recordings of multi-instrument classical music pieces with individual audio tracks. URMP was intented to be used for source separation, thus, we evaluate the performance on the URMP dataset of two different source-separation models trained on Solos. The dataset is publicly available at https://juanfmontesinos.github.io/Solos/

READ FULL TEXT
research
10/27/2020

Remixing Music with Visual Conditioning

We propose a visually conditioned music remixing system by incorporating...
research
12/27/2016

Creating A Multi-track Classical Musical Performance Dataset for Multimodal Music Analysis: Challenges, Insights, and Applications

We introduce a dataset for facilitating audio-visual analysis of musical...
research
07/09/2021

Blind Source Separation in Polyphonic Music Recordings Using Deep Neural Networks Trained via Policy Gradients

We propose a method for the blind separation of sounds of musical instru...
research
06/26/2019

Learning a Joint Embedding Space of Monophonic and Mixed Music Signals for Singing Voice

Previous approaches in singer identification have used one of monophonic...
research
02/03/2021

Music source separation conditioned on 3D point clouds

Recently, significant progress has been made in audio source separation ...
research
09/14/2022

CCOM-HuQin: an Annotated Multimodal Chinese Fiddle Performance Dataset

HuQin is a family of traditional Chinese bowed string instruments. Playi...
research
07/24/2023

Self-refining of Pseudo Labels for Music Source Separation with Noisy Labeled Data

Music source separation (MSS) faces challenges due to the limited availa...

Please sign up or login with your details

Forgot password? Click here to reset