MicAugment: One-shot Microphone Style Transfer

10/19/2020
by   Zalán Borsos, et al.
0

A crucial aspect for the successful deployment of audio-based models "in-the-wild" is the robustness to the transformations introduced by heterogeneous acquisition conditions. In this work, we propose a method to perform one-shot microphone style transfer. Given only a few seconds of audio recorded by a target device, MicAugment identifies the transformations associated to the input acquisition pipeline and uses the learned transformations to synthesize audio as if it were recorded under the same conditions as the target audio. We show that our method can successfully apply the style transfer to real audio and that it significantly increases model robustness when used as data augmentation in the downstream tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/29/2017

Time Domain Neural Audio Style Transfer

A recently published method for audio style transfer has shown how to ex...
research
12/18/2018

Autoencoder Based Architecture For Fast & Real Time Audio Style Transfer

Recently, there has been great interest in the field of audio style tran...
research
10/31/2017

Audio style transfer

"Style transfer" among images has recently emerged as a very active rese...
research
11/06/2022

Going In Style: Audio Backdoors Through Stylistic Transformations

A backdoor attack places triggers in victims' deep learning models to en...
research
06/17/2018

Cover Song Synthesis by Analogy

In this work, we pose and address the following "cover song analogies" p...
research
01/04/2018

Neural Style Transfer for Audio Spectograms

There has been fascinating work on creating artistic transformations of ...
research
06/13/2023

Robustness of SAM: Segment Anything Under Corruptions and Beyond

Segment anything model (SAM), as the name suggests, is claimed to be cap...

Please sign up or login with your details

Forgot password? Click here to reset