AR-NeRF: Unsupervised Learning of Depth and Defocus Effects from Natural Images with Aperture Rendering Neural Radiance Fields

06/13/2022
by   Takuhiro Kaneko, et al.
0

Fully unsupervised 3D representation learning has gained attention owing to its advantages in data collection. A successful approach involves a viewpoint-aware approach that learns an image distribution based on generative models (e.g., generative adversarial networks (GANs)) while generating various view images based on 3D-aware models (e.g., neural radiance fields (NeRFs)). However, they require images with various views for training, and consequently, their application to datasets with few or limited viewpoints remains a challenge. As a complementary approach, an aperture rendering GAN (AR-GAN) that employs a defocus cue was proposed. However, an AR-GAN is a CNN-based model and represents a defocus independently from a viewpoint change despite its high correlation, which is one of the reasons for its performance. As an alternative to an AR-GAN, we propose an aperture rendering NeRF (AR-NeRF), which can utilize viewpoint and defocus cues in a unified manner by representing both factors in a common ray-tracing framework. Moreover, to learn defocus-aware and defocus-independent representations in a disentangled manner, we propose aperture randomized training, for which we learn to generate images while randomizing the aperture size and latent codes independently. During our experiments, we applied AR-NeRF to various natural image datasets, including flower, bird, and face images, the results of which demonstrate the utility of AR-NeRF for unsupervised learning of the depth and defocus effects.

READ FULL TEXT

page 4

page 9

page 10

page 11

page 13

page 15

page 28

page 29

research
06/24/2021

Unsupervised Learning of Depth and Depth-of-Field Effect from Natural Images with Aperture Rendering Generative Adversarial Networks

Understanding the 3D world from 2D projected natural images is a fundame...
research
04/02/2019

HoloGAN: Unsupervised learning of 3D representations from natural images

We propose a novel generative adversarial network (GAN) for the task of ...
research
03/22/2023

NeRF-GAN Distillation for Memory-Efficient 3D-Aware Generation with Convolutions

Pose-conditioned convolutional generative models struggle with high-qual...
research
10/01/2019

Unsupervised Generative 3D Shape Learning from Natural Images

In this paper we present, to the best of our knowledge, the first method...
research
04/02/2018

Generative Spatiotemporal Modeling Of Neutrophil Behavior

Cell motion and appearance have a strong correlation with cell cycle and...
research
01/27/2023

HyperNeRFGAN: Hypernetwork approach to 3D NeRF GAN

Recently, generative models for 3D objects are gaining much popularity i...
research
04/16/2023

Likelihood-Based Generative Radiance Field with Latent Space Energy-Based Model for 3D-Aware Disentangled Image Representation

We propose the NeRF-LEBM, a likelihood-based top-down 3D-aware 2D image ...

Please sign up or login with your details

Forgot password? Click here to reset