Towards Robust and Truly Large-Scale Audio-Sheet Music Retrieval

09/21/2023
by   Luis Carvalho, et al.
0

A range of applications of multi-modal music information retrieval is centred around the problem of connecting large collections of sheet music (images) to corresponding audio recordings, that is, identifying pairs of audio and score excerpts that refer to the same musical content. One of the typical and most recent approaches to this task employs cross-modal deep learning architectures to learn joint embedding spaces that link the two distinct modalities - audio and sheet music images. While there has been steady improvement on this front over the past years, a number of open problems still prevent large-scale employment of this methodology. In this article we attempt to provide an insightful examination of the current developments on audio-sheet music retrieval via deep learning methods. We first identify a set of main challenges on the road towards robust and large-scale cross-modal music retrieval in real scenarios. We then highlight the steps we have taken so far to address some of these challenges, documenting step-by-step improvement along several dimensions. We conclude by analysing the remaining challenges and present ideas for solving these, in order to pave the way to a unified and robust methodology for cross-modal music retrieval.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/21/2023

Passage Summarization with Recurrent Models for Audio-Sheet Music Retrieval

Many applications of cross-modal music retrieval are related to connecti...
research
03/14/2023

Improving Music Genre Classification from multi-modal properties of music and genre correlations Perspective

Music genre classification has been widely studied in past few years for...
research
02/12/2019

Cross-Modal Music Retrieval and Applications: An Overview of Key Methodologies

There has been a rapid growth of digitally available music data, includi...
research
05/26/2021

Exploiting Temporal Dependencies for Cross-Modal Music Piece Identification

This paper addresses the problem of cross-modal musical piece identifica...
research
12/11/2019

deepsing: Generating Sentiment-aware Visual Stories using Cross-modal Music Translation

In this paper we propose a deep learning method for performing attribute...
research
06/26/2019

Learning Soft-Attention Models for Tempo-invariant Audio-Sheet Music Retrieval

Connecting large libraries of digitized audio recordings to their corres...
research
12/21/2022

RECAP: Retrieval Augmented Music Captioner

With the prevalence of stream media platforms serving music search and r...

Please sign up or login with your details

Forgot password? Click here to reset