Upmixing via style transfer: a variational autoencoder for disentangling spatial images and musical content

03/22/2022
by   Haici Yang, et al.
0

In the stereo-to-multichannel upmixing problem for music, one of the main tasks is to set the directionality of the instrument sources in the multichannel rendering results. In this paper, we propose a modified variational autoencoder model that learns a latent space to describe the spatial images in multichannel music. We seek to disentangle the spatial images and music content, so the learned latent variables are invariant to the music. At test time, we use the latent variables to control the panning of sources. We propose two upmixing use cases: transferring the spatial images from one song to another and blind panning based on the generative model. We report objective and subjective evaluation results to empirically show that our model captures spatial images separately from music content and achieves transfer-based interactive panning.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/13/2021

Multiple Style Transfer via Variational AutoEncoder

Modern works on style transfer focus on transferring style from a single...
research
09/03/2019

Translating Visual Art into Music

The Synesthetic Variational Autoencoder (SynVAE) introduced in this rese...
research
09/25/2019

Explicitly disentangling image content from translation and rotation with spatial-VAE

Given an image dataset, we are often interested in finding data generati...
research
01/15/2020

Learning Style-Aware Symbolic Music Representations by Adversarial Autoencoders

We address the challenging open problem of learning an effective latent ...
research
09/06/2023

Self-Supervised Disentanglement of Harmonic and Rhythmic Features in Music Audio Signals

The aim of latent variable disentanglement is to infer the multiple info...
research
05/31/2021

Factorising Meaning and Form for Intent-Preserving Paraphrasing

We propose a method for generating paraphrases of English questions that...
research
08/01/2021

SurpriseNet: Melody Harmonization Conditioning on User-controlled Surprise Contours

The surprisingness of a song is an essential and seemingly subjective fa...

Please sign up or login with your details

Forgot password? Click here to reset