Less is more: Faster and better music version identification with embedding distillation

10/07/2020
by   Furkan Yesiler, et al.
0

Version identification systems aim to detect different renditions of the same underlying musical composition (loosely called cover songs). By learning to encode entire recordings into plain vector embeddings, recent systems have made significant progress in bridging the gap between accuracy and scalability, which has been a key challenge for nearly two decades. In this work, we propose to further narrow this gap by employing a set of data distillation techniques that reduce the embedding dimensionality of a pre-trained state-of-the-art model. We compare a wide range of techniques and propose new ones, from classical dimensionality reduction to more sophisticated distillation schemes. With those, we obtain 99 accuracy increase. Such small embeddings can have an important impact in retrieval time, up to the point of making a real-world system practical on a standalone laptop.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/30/2023

Audio Embeddings as Teachers for Music Classification

Music classification has been one of the most popular tasks in the field...
research
10/28/2019

Accurate and Scalable Version Identification Using Musically-Motivated Embeddings

The version identification (VI) task deals with the automatic detection ...
research
09/06/2021

Audio-based Musical Version Identification: Elements and Challenges

In this article, we aim to provide a review of the key ideas and approac...
research
08/11/2017

Simple and Effective Dimensionality Reduction for Word Embeddings

Word embeddings have become the basic building blocks for several natura...
research
06/09/2021

Knowledge distillation: A good teacher is patient and consistent

There is a growing discrepancy in computer vision between large-scale mo...
research
09/30/2021

Assessing Algorithmic Biases for Musical Version Identification

Version identification (VI) systems now offer accurate and scalable solu...
research
12/14/2020

Recovery of Linear Components: Reduced Complexity Autoencoder Designs

Reducing dimensionality is a key preprocessing step in many data analysi...

Please sign up or login with your details

Forgot password? Click here to reset