Modeling Animal Vocalizations through Synthesizers

10/19/2022
by   Masato Hagiwara, et al.
0

Modeling real-world sound is a fundamental problem in the creative use of machine learning and many other fields, including human speech processing and bioacoustics. Transformer-based generative models and some prior work (e.g., DDSP) are known to produce realistic sound, although they have limited control and are hard to interpret. As an alternative, we aim to use modular synthesizers, i.e., compositional, parametric electronic musical instruments, for modeling non-music sounds. However, inferring synthesizer parameters given a target sound, i.e., the parameter inference task, is not trivial for general sounds, and past research has typically focused on musical sound. In this work, we optimize a differentiable synthesizer from TorchSynth in order to model, emulate, and creatively generate animal vocalizations. We compare an array of optimization methods, from gradient-based search to genetic algorithms, for inferring its parameters, and then demonstrate how one can control and interpret the parameters for modeling non-music sounds.

READ FULL TEXT
research
08/02/2021

Musical Speech: A Transformer-based Composition Tool

In this paper, we propose a new compositional tool that will generate a ...
research
11/04/2021

MT3: Multi-Task Multitrack Music Transcription

Automatic Music Transcription (AMT), inferring musical notes from raw au...
research
05/06/2022

Sound2Synth: Interpreting Sound via FM Synthesizer Parameters Estimation

Synthesizer is a type of electronic musical instrument that is now widel...
research
09/21/2022

Modeling Perceptual Loudness of Piano Tone: Theory and Applications

The relationship between perceptual loudness and physical attributes of ...
research
03/02/2023

AI as mediator between composers, sound designers, and creative media producers

Musical professionals who produce material for non-musical stakeholders ...
research
10/04/2020

Body, Clothes, Water, and Toys: Media Towards Natural Music Expressions with Digital Sounds

In this paper, we introduce our research challenges for creating new mus...
research
04/10/2022

Deep Conditional Representation Learning for Drum Sample Retrieval by Vocalisation

Imitating musical instruments with the human voice is an efficient way o...

Please sign up or login with your details

Forgot password? Click here to reset