Melody Generation for Pop Music via Word Representation of Musical Properties

10/31/2017
by   Andrew Shin, et al.
0

Automatic melody generation for pop music has been a long-time aspiration for both AI researchers and musicians. However, learning to generate euphonious melody has turned out to be highly challenging due to a number of factors. Representation of multivariate property of notes has been one of the primary challenges. It is also difficult to remain in the permissible spectrum of musical variety, outside of which would be perceived as a plain random play without auditory pleasantness. Observing the conventional structure of pop music poses further challenges. In this paper, we propose to represent each note and its properties as a unique `word,' thus lessening the prospect of misalignments between the properties, as well as reducing the complexity of learning. We also enforce regularization policies on the range of notes, thus encouraging the generated melody to stay close to what humans would find easy to follow. Furthermore, we generate melody conditioned on song part information, thus replicating the overall structure of a full song. Experimental results demonstrate that our model can generate auditorily pleasant songs that are more indistinguishable from human-written ones than previous models.

READ FULL TEXT
research
09/14/2021

Structure-Enhanced Pop Music Generation via Harmony-Aware Learning

Automatically composing pop music with a satisfactory structure is an at...
research
12/03/2016

DeepBach: a Steerable Model for Bach Chorales Generation

This paper introduces DeepBach, a graphical model aimed at modeling poly...
research
09/02/2021

Controllable deep melody generation via hierarchical music structure representation

Recent advances in deep learning have expanded possibilities to generate...
research
07/23/2021

Multi-Channel Automatic Music Transcription Using Tensor Algebra

Music is an art, perceived in unique ways by every listener, coming from...
research
08/26/2022

Mel Spectrogram Inversion with Stable Pitch

Vocoders are models capable of transforming a low-dimensional spectral r...
research
10/12/2016

Maximum entropy models for generation of expressive music

In the context of contemporary monophonic music, expression can be seen ...
research
11/25/2021

A-Muze-Net: Music Generation by Composing the Harmony based on the Generated Melody

We present a method for the generation of Midi files of piano music. The...

Please sign up or login with your details

Forgot password? Click here to reset