HoloDiffusion: Training a 3D Diffusion Model using 2D Images

03/29/2023
by   Animesh Karnewar, et al.
5

Diffusion models have emerged as the best approach for generative modeling of 2D images. Part of their success is due to the possibility of training them on millions if not billions of images with a stable learning objective. However, extending these models to 3D remains difficult for two reasons. First, finding a large quantity of 3D training data is much more complex than for 2D images. Second, while it is conceptually trivial to extend the models to operate on 3D rather than 2D grids, the associated cubic growth in memory and compute complexity makes this infeasible. We address the first challenge by introducing a new diffusion setup that can be trained, end-to-end, with only posed 2D images for supervision; and the second challenge by proposing an image formation model that decouples model memory from spatial memory. We evaluate our method on real-world data, using the CO3D dataset which has not been used to train 3D generative models before. We show that our diffusion models are scalable, train robustly, and are competitive in terms of sample quality and fidelity to existing approaches for 3D generative modeling.

READ FULL TEXT

page 1

page 3

page 6

page 7

research
01/30/2023

Extracting Training Data from Diffusion Models

Image diffusion models such as DALL-E 2, Imagen, and Stable Diffusion ha...
research
06/22/2023

Towards More Realistic Membership Inference Attacks on Large Diffusion Models

Generative diffusion models, including Stable Diffusion and Midjourney, ...
research
03/17/2023

On the De-duplication of LAION-2B

Generative models, such as DALL-E, Midjourney, and Stable Diffusion, hav...
research
12/11/2022

DiffAlign : Few-shot learning using diffusion based synthesis and alignment

We address the problem of few-shot classification where the goal is to l...
research
08/31/2023

Detecting Out-of-Context Image-Caption Pairs in News: A Counter-Intuitive Method

The growth of misinformation and re-contextualized media in social media...
research
03/25/2023

Better Aligning Text-to-Image Models with Human Preference

Recent years have witnessed a rapid growth of deep generative models, wi...
research
04/05/2018

The Kanerva Machine: A Generative Distributed Memory

We present an end-to-end trained memory system that quickly adapts to ne...

Please sign up or login with your details

Forgot password? Click here to reset