MAP-Music2Vec: A Simple and Effective Baseline for Self-Supervised Music Audio Representation Learning

12/05/2022
by   Yizhi Li, et al.
16

The deep learning community has witnessed an exponentially growing interest in self-supervised learning (SSL). However, it still remains unexplored how to build a framework for learning useful representations of raw music waveforms in a self-supervised manner. In this work, we design Music2Vec, a framework exploring different SSL algorithmic components and tricks for music audio recordings. Our model achieves comparable results to the state-of-the-art (SOTA) music SSL model Jukebox, despite being significantly smaller with less than 2 Huggingface(Please refer to: https://huggingface.co/m-a-p/music2vec-v1)

READ FULL TEXT

page 1

page 2

page 3

research
07/10/2022

Towards Proper Contrastive Self-supervised Learning Strategies For Music Audio Representation

The common research goal of self-supervised learning is to extract a gen...
research
03/17/2021

Contrastive Learning of Musical Representations

While supervised learning has enabled great advances in many areas of mu...
research
06/13/2022

Self-Supervised Representation Learning With MUlti-Segmental Informational Coding (MUSIC)

Self-supervised representation learning maps high-dimensional data into ...
research
09/26/2022

End-to-End Lyrics Recognition with Self-supervised Learning

Lyrics recognition is an important task in music processing. Despite tra...
research
07/07/2022

Self-Supervised Learning of Music-Dance Representation through Explicit-Implicit Rhythm Synchronization

Although audio-visual representation has been proved to be applicable in...
research
09/13/2017

Generating Music Medleys via Playing Music Puzzle Games

Generating music medleys is about finding an optimal permutation of a gi...
research
10/31/2022

Self-Supervised Hierarchical Metrical Structure Modeling

We propose a novel method to model hierarchical metrical structures for ...

Please sign up or login with your details

Forgot password? Click here to reset