Vis2Mus: Exploring Multimodal Representation Mapping for Controllable Music Generation

11/10/2022
by   Runbang Zhang, et al.
0

In this study, we explore the representation mapping from the domain of visual arts to the domain of music, with which we can use visual arts as an effective handle to control music generation. Unlike most studies in multimodal representation learning that are purely data-driven, we adopt an analysis-by-synthesis approach that combines deep music representation learning with user studies. Such an approach enables us to discover interpretable representation mapping without a huge amount of paired data. In particular, we discover that visual-to-music mapping has a nice property similar to equivariant. In other words, we can use various image transformations, say, changing brightness, changing contrast, style transfer, to control the corresponding transformations in the music domain. In addition, we released the Vis2Mus system as a controllable interface for symbolic music generation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/19/2018

Music Style Transfer: A Position Paper

Led by the success of neural style transfer on visual arts, there has be...
research
03/19/2018

Music Style Transfer Issues: A Position Paper

Led by the success of neural style transfer on visual arts, there has be...
research
08/17/2020

Learning Interpretable Representation for Controllable Polyphonic Music Generation

While deep generative models have become the leading methods for algorit...
research
06/02/2023

Q A: Query-Based Representation Learning for Multi-Track Symbolic Music re-Arrangement

Music rearrangement is a common music practice of reconstructing and rec...
research
09/15/2022

Domain Adversarial Training on Conditional Variational Auto-Encoder for Controllable Music Generation

The variational auto-encoder has become a leading framework for symbolic...
research
08/18/2022

Musika! Fast Infinite Waveform Music Generation

Fast and user-controllable music generation could enable novel ways of c...
research
07/29/2020

dMelodies: A Music Dataset for Disentanglement Learning

Representation learning focused on disentangling the underlying factors ...

Please sign up or login with your details

Forgot password? Click here to reset