Total-Recon: Deformable Scene Reconstruction for Embodied View Synthesis

04/24/2023
by   Chonghyuk Song, et al.
0

We explore the task of embodied view synthesis from monocular videos of deformable scenes. Given a minute-long RGBD video of people interacting with their pets, we render the scene from novel camera trajectories derived from in-scene motion of actors: (1) egocentric cameras that simulate the point of view of a target actor and (2) 3rd-person cameras that follow the actor. Building such a system requires reconstructing the root-body and articulated motion of each actor in the scene, as well as a scene representation that supports free-viewpoint synthesis. Longer videos are more likely to capture the scene from diverse viewpoints (which helps reconstruction) but are also more likely to contain larger motions (which complicates reconstruction). To address these challenges, we present Total-Recon, the first method to photorealistically reconstruct deformable scenes from long monocular RGBD videos. Crucially, to scale to long videos, our method hierarchically decomposes the scene motion into the motion of each object, which itself is decomposed into global root-body motion and local articulations. To quantify such "in-the-wild" reconstruction and view synthesis, we collect ground-truth data from a specialized stereo RGBD capture rig for 11 challenging videos, significantly outperforming prior art. Code, videos, and data can be found at https://andrewsonga.github.io/totalrecon .

READ FULL TEXT

page 6

page 7

page 18

page 19

page 20

page 21

page 22

page 23

research
12/04/2018

Monocular Total Capture: Posing Face, Body, and Hands in the Wild

We present the first method to capture the 3D total motion of a target p...
research
11/26/2020

4D Human Body Capture from Egocentric Video via 3D Scene Grounding

To understand human daily social interaction from egocentric perspective...
research
06/14/2022

3D scene reconstruction from monocular spherical video with motion parallax

In this paper, we describe a method to capture nearly entirely spherical...
research
12/23/2020

Vid2Actor: Free-viewpoint Animatable Person Synthesis from Video in the Wild

Given an "in-the-wild" video of a person, we reconstruct an animatable m...
research
07/31/2023

Onboard View Planning of a Flying Camera for High Fidelity 3D Reconstruction of a Moving Actor

Capturing and reconstructing a human actor's motion is important for fil...
research
04/21/2023

Factored Neural Representation for Scene Understanding

A long-standing goal in scene understanding is to obtain interpretable a...
research
08/27/2020

Reducing Drift in Structure from Motion using Extended Features

Low-frequency long-range errors (drift) are an endemic problem in 3D str...

Please sign up or login with your details

Forgot password? Click here to reset