Real-Time Radiance Fields for Single-Image Portrait View Synthesis

05/03/2023
by   Alex Trevithick, et al.
4

We present a one-shot method to infer and render a photorealistic 3D representation from a single unposed image (e.g., face portrait) in real-time. Given a single RGB input, our image encoder directly predicts a canonical triplane representation of a neural radiance field for 3D-aware novel view synthesis via volume rendering. Our method is fast (24 fps) on consumer hardware, and produces higher quality results than strong GAN-inversion baselines that require test-time optimization. To train our triplane encoder pipeline, we use only synthetic data, showing how to distill the knowledge from a pretrained 3D GAN into a feedforward encoder. Technical contributions include a Vision Transformer-based triplane encoder, a camera data augmentation strategy, and a well-designed loss function for synthetic data training. We benchmark against the state-of-the-art methods, demonstrating significant improvements in robustness and image quality in challenging real-world settings. We showcase our results on portraits of faces (FFHQ) and cats (AFHQ), but our algorithm can also be applied in the future to other categories with a 3D-aware image generator.

READ FULL TEXT

page 7

page 9

page 10

page 11

page 17

page 18

page 20

page 21

research
02/26/2022

Pix2NeRF: Unsupervised Conditional π-GAN for Single Image to Neural Radiance Fields Translation

We propose a pipeline to generate Neural Radiance Fields (NeRF) of an ob...
research
02/05/2021

Unsupervised Novel View Synthesis from a Single Image

Novel view synthesis from a single image aims at generating novel views ...
research
07/12/2022

Vision Transformer for NeRF-Based View Synthesis from a Single Input Image

Although neural radiance fields (NeRF) have shown impressive advances fo...
research
11/12/2022

3D-Aware Encoding for Style-based Neural Radiance Fields

We tackle the task of NeRF inversion for style-based neural radiance fie...
research
12/01/2021

HyperInverter: Improving StyleGAN Inversion via Hypernetwork

Real-world image manipulation has achieved fantastic progress in recent ...
research
03/23/2023

TriPlaneNet: An Encoder for EG3D Inversion

Recent progress in NeRF-based GANs has introduced a number of approaches...
research
12/20/2021

3D-aware Image Synthesis via Learning Structural and Textural Representations

Making generative models 3D-aware bridges the 2D image space and the 3D ...

Please sign up or login with your details

Forgot password? Click here to reset