Barwise Compression Schemes for Audio-Based Music Structure Analysis

02/10/2022
by   Axel Marmoret, et al.
0

Music Structure Analysis (MSA) consists in segmenting a music piece in several distinct sections. We approach MSA within a compression framework, under the hypothesis that the structure is more easily revealed by a simplified representation of the original content of the song. More specifically, under the hypothesis that MSA is correlated with similarities occurring at the bar scale, linear and non-linear compression schemes can be applied to barwise audio signals. Compressed representations capture the most salient components of the different bars in the song and are then used to infer the song structure using a dynamic programming algorithm. This work explores both low-rank approximation models such as Principal Component Analysis or Nonnegative Matrix Factorization and "piece-specific" Auto-Encoding Neural Networks, with the objective to learn latent representations specific to a given song. Such approaches do not rely on supervision nor annotations, which are well-known to be tedious to collect and possibly ambiguous in MSA description. In our experiments, several unsupervised compression schemes achieve a level of performance comparable to that of state-of-the-art supervised methods (for 3s tolerance) on the RWC-Pop dataset, showcasing the importance of the barwise compression processing for MSA.

READ FULL TEXT

page 4

page 5

research
10/27/2021

Exploring single-song autoencoding schemes for audio-based music structure analysis

The ability of deep neural networks to learn complex data relations and ...
research
10/27/2022

Convolutive Block-Matching Segmentation Algorithm with Application to Music Structure Analysis

Music Structure Analysis (MSA) consists of representing a song in sectio...
research
02/10/2022

Semi-Supervised Convolutive NMF for Automatic Music Transcription

Automatic Music Transcription, which consists in transforming an audio r...
research
03/30/2022

Forensic Analysis and Localization of Multiply Compressed MP3 Audio Using Transformers

Audio signals are often stored and transmitted in compressed formats. Am...
research
04/17/2021

Uncovering audio patterns in music with Nonnegative Tucker Decomposition for structural segmentation

Recent work has proposed the use of tensor decomposition to model repeti...
research
10/04/2017

Improving Compression Based Dissimilarity Measure for Music Score Analysis

In this paper, we propose a way to improve the compression based dissimi...

Please sign up or login with your details

Forgot password? Click here to reset