CoverHunter: Cover Song Identification with Refined Attention and Alignments

06/15/2023
by   Feng Liu, et al.
0

Abstract: Cover song identification (CSI) focuses on finding the same music with different versions in reference anchors given a query track. In this paper, we propose a novel system named CoverHunter that overcomes the shortcomings of existing detection schemes by exploring richer features with refined attention and alignments. CoverHunter contains three key modules: 1) A convolution-augmented transformer (i.e., Conformer) structure that captures both local and global feature interactions in contrast to previous methods mainly relying on convolutional neural networks; 2) An attention-based time pooling module that further exploits the attention in the time dimension; 3) A novel coarse-to-fine training scheme that first trains a network to roughly align the song chunks and then refines the network by training on the aligned chunks. At the same time, we also summarize some important training tricks used in our system that help achieve better results. Experiments on several standard CSI datasets show that our method significantly improves over state-of-the-art methods with an embedding size of 128 (2.3 DaTacos).

READ FULL TEXT

page 2

page 4

research
03/21/2023

ByteCover3: Accurate Cover Song Identification on Short Queries

Deep learning based methods have become a paradigm for cover song identi...
research
11/01/2019

Learning a Representation for Cover Song Identification Using Convolutional Neural Network

Cover song identification represents a challenging task in the field of ...
research
12/06/2022

AbHE: All Attention-based Homography Estimation

Homography estimation is a basic computer vision task, which aims to obt...
research
10/25/2022

End-to-end Transformer for Compressed Video Quality Enhancement

Convolutional neural networks have achieved excellent results in compres...
research
09/13/2022

DMTNet: Dynamic Multi-scale Network for Dual-pixel Images Defocus Deblurring with Transformer

Recent works achieve excellent results in defocus deblurring task based ...
research
10/27/2020

ByteCover: Cover Song Identification via Multi-Loss Training

We present in this paper ByteCover, which is a new feature learning meth...
research
07/19/2023

DisCover: Disentangled Music Representation Learning for Cover Song Identification

In the field of music information retrieval (MIR), cover song identifica...

Please sign up or login with your details

Forgot password? Click here to reset