AutoDecoding Latent 3D Diffusion Models

07/07/2023
by   Evangelos Ntavelis, et al.
0

We present a novel approach to the generation of static and articulated 3D assets that has a 3D autodecoder at its core. The 3D autodecoder framework embeds properties learned from the target dataset in the latent space, which can then be decoded into a volumetric representation for rendering view-consistent appearance and geometry. We then identify the appropriate intermediate volumetric latent space, and introduce robust normalization and de-normalization operations to learn a 3D diffusion from 2D images or monocular videos of rigid or articulated objects. Our approach is flexible enough to use either existing camera supervision or no camera information at all – instead efficiently learning it during training. Our evaluations demonstrate that our generation results outperform state-of-the-art alternatives on various benchmark datasets and metrics, including multi-view image datasets of synthetic objects, real in-the-wild videos of moving people, and a large-scale, real video dataset of static objects.

READ FULL TEXT

page 7

page 9

page 16

page 17

research
07/28/2022

Neural Strands: Learning Hair Geometry and Appearance from Multi-View Images

We present Neural Strands, a novel learning framework for modeling accur...
research
08/31/2023

MVDream: Multi-view Diffusion for 3D Generation

We propose MVDream, a multi-view diffusion model that is able to generat...
research
03/29/2022

Disentangled3D: Learning a 3D Generative Model with Disentangled Geometry and Appearance from Monocular Images

Learning 3D generative models from a dataset of monocular images enables...
research
06/14/2021

Flow Guided Transformable Bottleneck Networks for Motion Retargeting

Human motion retargeting aims to transfer the motion of one person in a ...
research
04/25/2019

Learning the Depths of Moving People by Watching Frozen People

We present a method for predicting dense depth in scenarios where both a...
research
03/03/2022

Playable Environments: Video Manipulation in Space and Time

We present Playable Environments - a new representation for interactive ...
research
05/10/2023

Reconstructing Animatable Categories from Videos

Building animatable 3D models is challenging due to the need for 3D scan...

Please sign up or login with your details

Forgot password? Click here to reset