ByteCover: Cover Song Identification via Multi-Loss Training

10/27/2020
by   Xingjian Du, et al.
0

We present in this paper ByteCover, which is a new feature learning method for cover song identification (CSI). ByteCover is built based on the classical ResNet model, and two major improvements are designed to further enhance the capability of the model for CSI. In the first improvement, we introduce the integration of instance normalization (IN) and batch normalization (BN) to build IBN blocks, which are major components of our ResNet-IBN model. With the help of the IBN blocks, our CSI model can learn features that are invariant to the changes of musical attributes such as key, tempo, timbre and genre, while preserving the version information. In the second improvement, we employ the BNNeck method to allow a multi-loss training and encourage our method to jointly optimize a classification loss and a triplet loss, and by this means, the inter-class discrimination and intra-class compactness of cover songs, can be ensured at the same time. A set of experiments demonstrated the effectiveness and efficiency of ByteCover on multiple datasets, and in the Da-TACOS dataset, ByteCover outperformed the best competitive system by 20.9%.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/19/2023

DisCover: Disentangled Music Representation Learning for Cover Song Identification

In the field of music information retrieval (MIR), cover song identifica...
research
12/20/2018

One-Class Feature Learning Using Intra-Class Splitting

This paper proposes a novel generic one-class feature learning method wh...
research
03/21/2023

ByteCover3: Accurate Cover Song Identification on Short Queries

Deep learning based methods have become a paradigm for cover song identi...
research
12/09/2020

Strong but Simple Baseline with Dual-Granularity Triplet Loss for Visible-Thermal Person Re-Identification

In this letter, we propose a conceptually simple and effective dual-gran...
research
04/06/2017

Beyond triplet loss: a deep quadruplet network for person re-identification

Person re-identification (ReID) is an important task in wide area video ...
research
06/15/2023

CoverHunter: Cover Song Identification with Refined Attention and Alignments

Abstract: Cover song identification (CSI) focuses on finding the same mu...

Please sign up or login with your details

Forgot password? Click here to reset