Music-to-Text Synaesthesia: Generating Descriptive Text from Music Recordings

10/02/2022
by   Zhihuan Kuang, et al.
0

In this paper, we consider a novel research problem, music-to-text synaesthesia. Different from the classical music tagging problem that classifies a music recording into pre-defined categories, the music-to-text synaesthesia aims to generate descriptive texts from music recordings for further understanding. Although this is a new and interesting application to the machine learning community, to our best knowledge, the existing music-related datasets do not contain the semantic descriptions on music recordings and cannot serve the music-to-text synaesthesia task. In light of this, we collect a new dataset that contains 1,955 aligned pairs of classical music recordings and text descriptions. Based on this, we build a computational model to generate sentences that can describe the content of the music recording. To tackle the highly non-discriminative classical music, we design a group topology-preservation loss in our computational model, which considers more samples as a group reference and preserves the relative topology among different samples. Extensive experimental results qualitatively and quantitatively demonstrate the effectiveness of our proposed model over five heuristics or pre-trained competitive methods and their variants on our collected dataset.

READ FULL TEXT
research
09/05/2022

Bridging Music and Text with Crowdsourced Music Comments: A Sequence-to-Sequence Framework for Thematic Music Comments Generation

We consider a novel task of automatically generating text descriptions o...
research
06/15/2023

Language-Guided Music Recommendation for Video via Prompt Analogies

We propose a method to recommend music for an input video while allowing...
research
06/29/2018

Exploratory Analysis of a Large Flamenco Corpus using an Ensemble of Convolutional Neural Networks as a Structural Annotation Backend

We present computational tools that we developed for the analysis of a l...
research
05/14/2022

Generating Tips from Song Reviews: A New Dataset and Framework

Reviews of songs play an important role in online music service platform...
research
02/20/2022

towards automatic transcription of polyphonic electric guitar music:a new dataset and a multi-loss transformer model

In this paper, we propose a new dataset named EGDB, that con-tains trans...
research
05/02/2022

Music Interpretation Analysis. A Multimodal Approach To Score-Informed Resynthesis of Piano Recordings

This Thesis discusses the development of technologies for the automatic ...
research
02/18/2022

Deep-Learning Architectures for Multi-Pitch Estimation: Towards Reliable Evaluation

Extracting pitch information from music recordings is a challenging but ...

Please sign up or login with your details

Forgot password? Click here to reset