EpiGRAF: Rethinking training of 3D GANs

06/21/2022
by   Ivan Skorokhodov, et al.
14

A very recent trend in generative modeling is building 3D-aware generators from 2D image collections. To induce the 3D bias, such models typically rely on volumetric rendering, which is expensive to employ at high resolutions. During the past months, there appeared more than 10 works that address this scaling issue by training a separate 2D decoder to upsample a low-resolution image (or a feature tensor) produced from a pure 3D generator. But this solution comes at a cost: not only does it break multi-view consistency (i.e. shape and texture change when the camera moves), but it also learns the geometry in a low fidelity. In this work, we show that it is possible to obtain a high-resolution 3D generator with SotA image quality by following a completely different route of simply training the model patch-wise. We revisit and improve this optimization scheme in two ways. First, we design a location- and scale-aware discriminator to work on patches of different proportions and spatial positions. Second, we modify the patch sampling strategy based on an annealed beta distribution to stabilize training and accelerate the convergence. The resulted model, named EpiGRAF, is an efficient, high-resolution, pure 3D generator, and we test it on four datasets (two introduced in this work) at 256^2 and 512^2 resolutions. It obtains state-of-the-art image quality, high-fidelity geometry and trains ≈ 2.5 × faster than the upsampler-based counterparts. Project website: https://universome.github.io/epigraf.

READ FULL TEXT

page 1

page 5

page 19

page 20

research
12/15/2021

Efficient Geometry-aware 3D Generative Adversarial Networks

Unsupervised generation of high-quality multi-view-consistent images and...
research
03/31/2023

GVP: Generative Volumetric Primitives

Advances in 3D-aware generative models have pushed the boundary of image...
research
12/09/2022

4K-NeRF: High Fidelity Neural Radiance Fields at Ultra High Resolutions

In this paper, we present a novel and effective framework, named 4K-NeRF...
research
03/16/2023

Mimic3D: Thriving 3D-Aware GANs via 3D-to-2D Imitation

Generating images with both photorealism and multiview 3D consistency is...
research
12/21/2021

StyleSDF: High-Resolution 3D-Consistent Image and Geometry Generation

We introduce a high resolution, 3D-consistent image and shape generation...
research
03/02/2023

3D generation on ImageNet

Existing 3D-from-2D generators are typically designed for well-curated s...
research
12/03/2019

Analyzing and Improving the Image Quality of StyleGAN

The style-based GAN architecture (StyleGAN) yields state-of-the-art resu...

Please sign up or login with your details

Forgot password? Click here to reset