Multi-Task Music Representation Learning from Multi-Label Embeddings

09/17/2019
by   Alexander Schindler, et al.
0

This paper presents a novel approach to music representation learning. Triplet loss based networks have become popular for representation learning in various multimedia retrieval domains. Yet, one of the most crucial parts of this approach is the appropriate selection of triplets, which is indispensable, considering that the number of possible triplets grows cubically. We present an approach to harness multi-tag annotations for triplet selection, by using Latent Semantic Indexing to project the tags onto a high-dimensional space. From this we estimate tag-relatedness to select hard triplets. The approach is evaluated in a multi-task scenario for which we introduce four large multi-tag annotations for the Million Song Dataset for the music properties genres, styles, moods, and themes.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

page 6

research
10/30/2020

Multimodal Metric Learning for Tag-based Music Retrieval

Tag-based music retrieval is crucial to browse large-scale music librari...
research
09/16/2020

Multilingual Music Genre Embeddings for Effective Cross-Lingual Music Item Annotation

Annotating music items with music genres is crucial for music recommenda...
research
03/24/2016

Recursive Neural Language Architecture for Tag Prediction

We consider the problem of learning distributed representations for tags...
research
07/18/2019

Leveraging Knowledge Bases And Parallel Annotations For Music Genre Translation

Prevalent efforts have been put in automatically inferring genres of mus...
research
11/26/2022

Toward Universal Text-to-Music Retrieval

This paper introduces effective design choices for text-to-music retriev...
research
11/02/2021

Multi-input Architecture and Disentangled Representation Learning for Multi-dimensional Modeling of Music Similarity

In the context of music information retrieval, similarity-based approach...
research
03/27/2020

Unsupervised Cross-Modal Audio Representation Learning from Unstructured Multilingual Text

We present an approach to unsupervised audio representation learning. Ba...

Please sign up or login with your details

Forgot password? Click here to reset