The Mirrornet : Learning Audio Synthesizer Controls Inspired by Sensorimotor Interaction

10/12/2021
by   Yashish M. Siriwardena, et al.
0

Experiments to understand the sensorimotor neural interactions in the human cortical speech system support the existence of a bidirectional flow of interactions between the auditory and motor regions. Their key function is to enable the brain to 'learn' how to control the vocal tract for speech production. This idea is the impetus for the recently proposed "MirrorNet", a constrained autoencoder architecture. In this paper, the MirrorNet is applied to learn, in an unsupervised manner, the controls of a specific audio synthesizer (DIVA) to produce melodies only from their auditory spectrograms. The results demonstrate how the MirrorNet discovers the synthesizer parameters to generate the melodies that closely resemble the original and those of unseen melodies, and even determine the best set parameters to approximate renditions of complex piano melodies generated by a different synthesizer. This generalizability of the MirrorNet illustrates its potential to discover from sensory data the controls of arbitrary motor-plants such as autonomous vehicles.

READ FULL TEXT

page 3

page 4

research
10/29/2022

Learning to Compute the Articulatory Representations of Speech with the MIRRORNET

Most organisms including humans function by coordinating and integrating...
research
10/27/2022

Articulation GAN: Unsupervised modeling of articulatory learning

Generative deep neural networks are widely used for speech synthesis, bu...
research
04/19/2021

Bidirectional Interaction between Visual and Motor Generative Models using Predictive Coding and Active Inference

In this work, we build upon the Active Inference (AIF) and Predictive Co...
research
02/22/2005

The Self-Organization of Speech Sounds

The speech code is a vehicle of language: it defines a set of forms used...
research
12/21/2022

An Audio-Visual Speech Separation Model Inspired by Cortico-Thalamo-Cortical Circuits

Audio-visual approaches involving visual inputs have laid the foundation...
research
05/01/2019

A Self-Organizing Network with Varying Density Structure for Characterizing Sensorimotor Transformations in Robotic Systems

In this work, we present the development of a neuro-inspired approach fo...
research
02/07/2023

Network-based Statistics Distinguish Anomic and Broca Aphasia

Aphasia is a speech-language impairment commonly caused by damage to the...

Please sign up or login with your details

Forgot password? Click here to reset