TräumerAI: Dreaming Music with StyleGAN

02/09/2021
by   Dasaem Jeong, et al.
23

The goal of this paper to generate a visually appealing video that responds to music with a neural network so that each frame of the video reflects the musical characteristics of the corresponding audio clip. To achieve the goal, we propose a neural music visualizer directly mapping deep music embeddings to style embeddings of StyleGAN, named TräumerAI, which consists of a music auto-tagging model using short-chunk CNN and StyleGAN2 pre-trained on WikiArt dataset. Rather than establishing an objective metric between musical and visual semantics, we manually labeled the pairs in a subjective manner. An annotator listened to 100 music clips of 10 seconds long and selected an image that suits the music among the 200 StyleGAN-generated examples. Based on the collected data, we trained a simple transfer function that converts an audio embedding to a style embedding. The generated examples show that the mapping between audio and video makes a certain level of intra-segment similarity and inter-segment dissimilarity.

READ FULL TEXT

page 4

page 5

research
04/13/2021

Comparison and Analysis of Deep Audio Embeddings for Music Emotion Recognition

Emotion is a complicated notion present in music that is hard to capture...
research
05/11/2023

V2Meow: Meowing to the Visual Beat via Music Generation

Generating high quality music that complements the visual content of a v...
research
06/30/2023

Audio Embeddings as Teachers for Music Classification

Music classification has been one of the most popular tasks in the field...
research
02/17/2022

End-to-end Music Remastering System Using Self-supervised and Adversarial Training

Mastering is an essential step in music production, but it is also a cha...
research
06/21/2019

Understanding and Classifying Cultural Music Using Melodic Features Case Of Hindustani, Carnatic And Turkish Music

We present a melody based classification of musical styles by exploiting...
research
02/04/2022

Musical Audio Similarity with Self-supervised Convolutional Neural Networks

We have built a music similarity search engine that lets video producers...
research
09/04/2023

MDSC: Towards Evaluating the Style Consistency Between Music and

We propose MDSC(Music-Dance-Style Consistency), the first evaluation met...

Please sign up or login with your details

Forgot password? Click here to reset