Self-Supervised Visual Acoustic Matching

07/27/2023
by   Arjun Somayazulu, et al.
0

Acoustic matching aims to re-synthesize an audio clip to sound as if it were recorded in a target acoustic environment. Existing methods assume access to paired training data, where the audio is observed in both source and target environments, but this limits the diversity of training data or requires the use of simulated data or heuristics to create paired samples. We propose a self-supervised approach to visual acoustic matching where training samples include only the target scene image and audio – without acoustically mismatched source audio for reference. Our approach jointly learns to disentangle room acoustics and re-synthesize audio into the target environment, via a conditional GAN framework and a novel metric that quantifies the level of residual acoustic information in the de-biased audio. Training with either in-the-wild web data or simulated data, we demonstrate it outperforms the state-of-the-art on multiple challenging datasets and a wide variety of real-world audio and environments.

READ FULL TEXT

page 7

page 8

page 9

research
02/14/2022

Visual Acoustic Matching

We introduce the visual acoustic matching task, in which an audio clip i...
research
07/28/2020

Self-supervised Neural Audio-Visual Sound Source Localization via Probabilistic Spatial Modeling

Detecting sound source objects within visual observation is important fo...
research
02/04/2023

AV-NeRF: Learning Neural Fields for Real-World Audio-Visual Scene Synthesis

Human perception of the complex world relies on a comprehensive analysis...
research
06/08/2022

Few-Shot Audio-Visual Learning of Environment Acoustics

Room impulse response (RIR) functions capture how the surrounding physic...
research
01/20/2023

Novel-View Acoustic Synthesis

We introduce the novel-view acoustic synthesis (NVAS) task: given the si...
research
10/09/2021

Visually Exploring Multi-Purpose Audio Data

We analyse multi-purpose audio using tools to visualise similarities wit...
research
11/02/2022

Adversarial Guitar Amplifier Modelling With Unpaired Data

We propose an audio effects processing framework that learns to emulate ...

Please sign up or login with your details

Forgot password? Click here to reset