Replay: Multi-modal Multi-view Acted Videos for Casual Holography

07/22/2023
by   Roman Shapovalov, et al.
0

We introduce Replay, a collection of multi-view, multi-modal videos of humans interacting socially. Each scene is filmed in high production quality, from different viewpoints with several static cameras, as well as wearable action cameras, and recorded with a large array of microphones at different positions in the room. Overall, the dataset contains over 4000 minutes of footage and over 7 million timestamped high-resolution frames annotated with camera poses and partially with foreground masks. The Replay dataset has many potential applications, such as novel-view synthesis, 3D reconstruction, novel-view acoustic synthesis, human body and face analysis, and training generative models. We provide a benchmark for training and evaluating novel-view synthesis, with two scenarios of different difficulty. Finally, we evaluate several baseline state-of-the-art methods on the new benchmark.

READ FULL TEXT

page 2

page 7

page 8

page 13

page 14

page 15

research
01/17/2023

A Large-Scale Outdoor Multi-modal Dataset and Benchmark for Novel View Synthesis and Implicit Scene Reconstruction

Neural Radiance Fields (NeRF) has achieved impressive results in single ...
research
10/17/2022

Temporal and Contextual Transformer for Multi-Camera Editing of TV Shows

The ability to choose an appropriate camera view among multiple cameras ...
research
01/20/2023

Novel-View Acoustic Synthesis

We introduce the novel-view acoustic synthesis (NVAS) task: given the si...
research
08/30/2021

Deep 3D Mask Volume for View Synthesis of Dynamic Scenes

Image view synthesis has seen great success in reconstructing photoreali...
research
11/22/2022

Depth-Supervised NeRF for Multi-View RGB-D Operating Room Images

Neural Radiance Fields (NeRF) is a powerful novel technology for the rec...
research
12/01/2018

Multi-View Egocentric Video Summarization

With vast amounts of video content being uploaded to the Internet every ...
research
08/11/2022

H4M: Heterogeneous, Multi-source, Multi-modal, Multi-view and Multi-distributional Dataset for Socioeconomic Analytics in the Case of Beijing

The study of socioeconomic status has been reformed by the availability ...

Please sign up or login with your details

Forgot password? Click here to reset