3D generation on ImageNet

03/02/2023
by   Ivan Skorokhodov, et al.
0

Existing 3D-from-2D generators are typically designed for well-curated single-category datasets, where all the objects have (approximately) the same scale, 3D location, and orientation, and the camera always points to the center of the scene. This makes them inapplicable to diverse, in-the-wild datasets of non-alignable scenes rendered from arbitrary camera poses. In this work, we develop a 3D generator with Generic Priors (3DGP): a 3D synthesis framework with more general assumptions about the training data, and show that it scales to very challenging datasets, like ImageNet. Our model is based on three new ideas. First, we incorporate an inaccurate off-the-shelf depth estimator into 3D GAN training via a special depth adaptation module to handle the imprecision. Then, we create a flexible camera model and a regularization strategy for it to learn its distribution parameters during training. Finally, we extend the recent ideas of transferring knowledge from pre-trained classifiers into GANs for patch-wise trained models by employing a simple distillation-based technique on top of the discriminator. It achieves more stable training than the existing methods and speeds up the convergence by at least 40 Elephants 256x256, LSUN Horses 256x256, and ImageNet 256x256, and demonstrate that 3DGP outperforms the recent state-of-the-art in terms of both texture and geometry quality. Code and visualizations: https://snap-research.github.io/3dgp.

READ FULL TEXT

page 1

page 6

page 21

page 22

page 23

page 24

page 28

page 29

research
01/06/2023

3DAvatarGAN: Bridging Domains for Personalized Editable Avatars

Modern 3D-GANs synthesize geometry and texture by training on large-scal...
research
03/30/2023

KD-DLGAN: Data Limited Image Generation via Knowledge Distillation

Generative Adversarial Networks (GANs) rely heavily on large-scale train...
research
06/22/2023

Squeeze, Recover and Relabel: Dataset Condensation at ImageNet Scale From A New Perspective

We present a new dataset condensation framework termed Squeeze, Recover ...
research
12/14/2022

NoPe-NeRF: Optimising Neural Radiance Field with No Pose Prior

Training a Neural Radiance Field (NeRF) without pre-computed camera pose...
research
06/21/2022

EpiGRAF: Rethinking training of 3D GANs

A very recent trend in generative modeling is building 3D-aware generato...
research
02/05/2019

Perturbative GAN: GAN with Perturbation Layers

Perturbative GAN, which replaces convolution layers of existing convolut...
research
02/01/2022

StyleGAN-XL: Scaling StyleGAN to Large Diverse Datasets

Computer graphics has experienced a recent surge of data-centric approac...

Please sign up or login with your details

Forgot password? Click here to reset