DiVA-360: The Dynamic Visuo-Audio Dataset for Immersive Neural Fields

07/31/2023
by   Cheng-You Lu, et al.
0

Advances in neural fields are enabling high-fidelity capture of the shape and appearance of static and dynamic scenes. However, their capabilities lag behind those offered by representations such as pixels or meshes due to algorithmic challenges and the lack of large-scale real-world datasets. We address the dataset limitation with DiVA-360, a real-world 360 dynamic visual-audio dataset with synchronized multimodal visual, audio, and textual information about table-scale scenes. It contains 46 dynamic scenes, 30 static scenes, and 95 static objects spanning 11 categories captured using a new hardware system using 53 RGB cameras at 120 FPS and 6 microphones for a total of 8.6M image frames and 1360 s of dynamic data. We provide detailed text descriptions for all scenes, foreground-background segmentation masks, category-specific 3D pose alignment for static objects, as well as metrics for comparison. Our data, hardware and software, and code are available at https://diva360.github.io/.

READ FULL TEXT

page 1

page 4

page 20

page 22

page 23

page 24

page 25

page 26

research
06/16/2023

OCTScenes: A Versatile Real-World Dataset of Tabletop Scenes for Object-Centric Learning

Humans possess the cognitive ability to comprehend scenes in a compositi...
research
09/20/2022

wildNeRF: Complete view synthesis of in-the-wild dynamic scenes captured using sparse monocular data

We present a novel neural radiance model that is trainable in a self-sup...
research
03/25/2023

SUDS: Scalable Urban Dynamic Scenes

We extend neural radiance fields (NeRFs) to dynamic large-scale urban sc...
research
06/09/2023

RePaint-NeRF: NeRF Editting via Semantic Masks and Diffusion Models

The emergence of Neural Radiance Fields (NeRF) has promoted the developm...
research
03/30/2022

Iterative Deep Homography Estimation

We propose Iterative Homography Network, namely IHN, a new deep homograp...
research
07/14/2022

Egocentric Scene Understanding via Multimodal Spatial Rectifier

In this paper, we study a problem of egocentric scene understanding, i.e...
research
01/24/2023

K-Planes: Explicit Radiance Fields in Space, Time, and Appearance

We introduce k-planes, a white-box model for radiance fields in arbitrar...

Please sign up or login with your details

Forgot password? Click here to reset