Zero-shot Learning for Audio-based Music Classification and Tagging

07/05/2019
by   Jeong Choi, et al.
0

Audio-based music classification and tagging is typically based on categorical supervised learning with a fixed set of labels. This intrinsically cannot handle unseen labels such as newly added music genres or semantic words that users arbitrarily choose for music retrieval. Zero-shot learning can address this problem by leveraging an additional semantic space of labels where side information about the labels is used to unveil the relationship between each other. In this work, we investigate the zero-shot learning in the music domain and organize two different setups of side information. One is using human-labeled attribute information based on Free Music Archive and OpenMIC-2018 datasets. The other is using general word semantic information based on Million Song Dataset and Last.fm tag annotations. Considering a music track is usually multi-labeled in music classification and tagging datasets, we also propose a data split scheme and associated evaluation settings for the multi-label zero-shot learning. Finally, we report experimental results and discuss the effectiveness and new possibilities of zero-shot learning in the music domain.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/20/2019

Zero-shot Learning and Knowledge Transfer in Music Classification and Tagging

Music classification and tagging is conducted through categorical superv...
research
10/27/2019

Transferring neural speech waveform synthesizers to musical instrument sounds generation

Recent neural waveform synthesizers such as WaveNet, WaveGlow, and the n...
research
08/24/2022

Improved Zero-Shot Audio Tagging Classification with Patchout Spectrogram Transformers

Standard machine learning models for tagging and classifying acoustic si...
research
08/26/2022

MuLan: A Joint Embedding of Music Audio and Natural Language

Music tagging and content-based retrieval systems have traditionally bee...
research
01/09/2023

Leveraging Contextual Relatedness to Identify Suicide Documentation in Clinical Notes through Zero Shot Learning

Identifying suicidality including suicidal ideation, attempts, and risk ...
research
06/10/2022

Zero-Shot Audio Classification using Image Embeddings

Supervised learning methods can solve the given problem in the presence ...
research
02/27/2015

Probabilistic Zero-shot Classification with Semantic Rankings

In this paper we propose a non-metric ranking-based representation of se...

Please sign up or login with your details

Forgot password? Click here to reset