Actions Speak Louder than Listening: Evaluating Music Style Transfer based on Editing Experience

10/25/2021
by   Wei-Tsung Lu, et al.
0

The subjective evaluation of music generation techniques has been mostly done with questionnaire-based listening tests while ignoring the perspectives from music composition, arrangement, and soundtrack editing. In this paper, we propose an editing test to evaluate users' editing experience of music generation models in a systematic way. To do this, we design a new music style transfer model combining the non-chronological inference architecture, autoregressive models and the Transformer, which serves as an improvement from the baseline model on the same style transfer task. Then, we compare the performance of the two models with a conventional listening test and the proposed editing test, in which the quality of generated samples is assessed by the amount of effort (e.g., the number of required keyboard and mouse actions) spent by users to polish a music clip. Results on two target styles indicate that the improvement over the baseline model can be reflected by the editing test quantitatively. Also, the editing test provides profound insights which are not accessible from usual listening tests. The major contribution of this paper is the systematic presentation of the editing test and the corresponding insights, while the proposed music style transfer model based on state-of-the-art neural networks represents another contribution.

READ FULL TEXT
research
01/27/2023

Prompt-Based Editing for Text Style Transfer

Prompting approaches have been recently explored in text style transfer,...
research
09/20/2018

MIDI-VAE: Modeling Dynamics and Instrumentation of Music with Applications to Style Transfer

We introduce MIDI-VAE, a neural network model based on Variational Autoe...
research
01/21/2020

Neural Style Difference Transfer and Its Application to Font Generation

Designing fonts requires a great deal of time and effort. It requires pr...
research
03/19/2018

Music Style Transfer: A Position Paper

Led by the success of neural style transfer on visual arts, there has be...
research
10/19/2019

XL-Editor: Post-editing Sentences with XLNet

While neural sequence generation models achieve initial success for many...
research
11/01/2022

SDMuse: Stochastic Differential Music Editing and Generation via Hybrid Representation

While deep generative models have empowered music generation, it remains...
research
03/19/2018

Music Style Transfer Issues: A Position Paper

Led by the success of neural style transfer on visual arts, there has be...

Please sign up or login with your details

Forgot password? Click here to reset