Points2Sound: From mono to binaural audio using 3D point cloud scenes

04/26/2021
by   Francesc Lluís, et al.
3

Binaural sound that matches the visual counterpart is crucial to bring meaningful and immersive experiences to people in augmented reality (AR) and virtual reality (VR) applications. Recent works have shown the possibility to generate binaural audio from mono using 2D visual information as guidance. Using 3D visual information may allow for a more accurate representation of a virtual audio scene for VR/AR applications. This paper proposes Points2Sound, a multi-modal deep learning model which generates a binaural version from mono audio using 3D point cloud scenes. Specifically, Points2Sound consist of a vision network which extracts visual features from the point cloud scene to condition an audio network, which operates in the waveform domain, to synthesize the binaural version. Both quantitative and perceptual evaluations indicate that our proposed model is preferred over a reference case, based on a recent 2D mono-to-binaural model.

READ FULL TEXT

page 4

page 5

page 7

research
02/03/2021

Music source separation conditioned on 3D point clouds

Recently, significant progress has been made in audio source separation ...
research
02/02/2023

Listen2Scene: Interactive material-aware binaural sound propagation for reconstructed 3D scenes

We present an end-to-end binaural audio rendering approach (Listen2Scene...
research
11/04/2021

A QoE Model in Point Cloud Video Streaming

Point cloud video has been widely used by augmented reality (AR) and vir...
research
08/23/2022

VRBubble: Enhancing Peripheral Awareness of Avatars for People with Visual Impairments in Social Virtual Reality

Social Virtual Reality (VR) is growing for remote socialization and coll...
research
07/12/2023

Semantic Communications System with Model Division Multiple Access and Controllable Coding Rate for Point Cloud

Point cloud, as a 3D representation, is widely used in autonomous drivin...
research
01/19/2022

On the impact of VR assessment on the Quality of Experience of Highly Realistic Digital Humans

Fuelled by the increase in popularity of virtual and augmented reality a...
research
05/26/2020

What the HoloLens Maps Is Your Workspace: Fast Mapping and Set-up of Robot Cells via Head Mounted Displays and Augmented Reality

Classical methods of modelling and mapping robot work cells are time con...

Please sign up or login with your details

Forgot password? Click here to reset