Source Separation-based Data Augmentation for Improved Joint Beat and Downbeat Tracking

by   Ching-Yu Chiu, et al.

Due to advances in deep learning, the performance of automatic beat and downbeat tracking in musical audio signals has seen great improvement in recent years. In training such deep learning based models, data augmentation has been found an important technique. However, existing data augmentation methods for this task mainly target at balancing the distribution of the training data with respect to their tempo. In this paper, we investigate another approach for data augmentation, to account for the composition of the training data in terms of the percussive and non-percussive sound sources. Specifically, we propose to employ a blind drum separation model to segregate the drum and non-drum sounds from each training audio signal, filtering out training signals that are drumless, and then use the obtained drum and non-drum stems to augment the training data. We report experiments on four completely unseen test sets, validating the effectiveness of the proposed method, and accordingly the importance of drum sound composition in the training data for beat and downbeat tracking.



page 3

page 4


Mixing-Specific Data Augmentation Techniques for Improved Blind Violin/Piano Source Separation

Blind music source separation has been a popular and active subject of r...

Structure and Automatic Segmentation of Dhrupad Vocal Bandish Audio

A Dhrupad vocal concert comprises a composition section that is interspe...

Drum-Aware Ensemble Architecture for Improved Joint Musical Beat and Downbeat Tracking

This paper presents a novel system architecture that integrates blind so...

Spatial mixup: Directional loudness modification as data augmentation for sound event localization and detection

Data augmentation methods have shown great importance in diverse supervi...

Improving singing voice separation using Deep U-Net and Wave-U-Net with data augmentation

State-of-the-art singing voice separation is based on deep learning maki...

Data augmentation using generative networks to identify dementia

Data limitation is one of the most common issues in training machine lea...

Singing voice separation: a study on training data

In the recent years, singing voice separation systems showed increased p...

Code Repositories

This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.