Foley Music: Learning to Generate Music from Videos

07/21/2020
by   Chuang Gan, et al.
0

In this paper, we introduce Foley Music, a system that can synthesize plausible music for a silent video clip about people playing musical instruments. We first identify two key intermediate representations for a successful video to music generator: body keypoints from videos and MIDI events from audio recordings. We then formulate music generation from videos as a motion-to-MIDI translation problem. We present a Graph-Transformer framework that can accurately predict MIDI event sequences in accordance with the body movements. The MIDI event can then be converted to realistic music using an off-the-shelf music synthesizer tool. We demonstrate the effectiveness of our models on videos containing a variety of music performances. Experimental results show that our model outperforms several existing systems in generating music that is pleasant to listen to. More importantly, the MIDI representations are fully interpretable and transparent, thus enabling us to perform music editing flexibly. We encourage the readers to watch the demo video with audio turned on to experience the results.

READ FULL TEXT

page 2

page 12

research
01/22/2023

Dance2MIDI: Dance-driven multi-instruments music generation

Dance-driven music generation aims to generate musical pieces conditione...
research
12/07/2020

Multi-Instrumentalist Net: Unsupervised Generation of Music from Body Movements

We propose a novel system that takes as an input body movements of a mus...
research
12/19/2017

Audio to Body Dynamics

We present a method that gets as input an audio of violin or piano playi...
research
04/01/2022

Quantized GAN for Complex Music Generation from Dance Videos

We present Dance2Music-GAN (D2M-GAN), a novel adversarial multi-modal fr...
research
08/24/2023

Exploiting Time-Frequency Conformers for Music Audio Enhancement

With the proliferation of video platforms on the internet, recording mus...
research
11/01/2015

Using Raspberry Pi for scientific video observation of pedestrians during a music festival

The document serves as a reference for researchers trying to capture a l...
research
06/23/2020

Audeo: Audio Generation for a Silent Performance Video

We present a novel system that gets as an input video frames of a musici...

Please sign up or login with your details

Forgot password? Click here to reset