Physically Disentangled Representations

04/11/2022
by   Tzofi Klinghoffer, et al.
1

State-of-the-art methods in generative representation learning yield semantic disentanglement, but typically do not consider physical scene parameters, such as geometry, albedo, lighting, or camera. We posit that inverse rendering, a way to reverse the rendering process to recover scene parameters from an image, can also be used to learn physically disentangled representations of scenes without supervision. In this paper, we show the utility of inverse rendering in learning representations that yield improved accuracy on downstream clustering, linear classification, and segmentation tasks with the help of our novel Leave-One-Out, Cycle Contrastive loss (LOOCC), which improves disentanglement of scene parameters and robustness to out-of-distribution lighting and viewpoints. We perform a comparison of our method with other generative representation learning methods across a variety of downstream tasks, including face attribute classification, emotion recognition, identification, face segmentation, and car classification. Our physically disentangled representations yield higher accuracy than semantically disentangled alternatives across all tasks and by as much as 18 will motivate future research in applying advances in inverse rendering and 3D understanding to representation learning.

READ FULL TEXT
research
08/14/2021

Unsupervised Disentanglement without Autoencoding: Pitfalls and Future Directions

Disentangled visual representations have largely been studied with gener...
research
10/23/2021

Group-disentangled Representation Learning with Weakly-Supervised Regularization

Learning interpretable and human-controllable representations that uncov...
research
11/18/2022

Multi-view Inverse Rendering for Large-scale Real-world Indoor Scenes

We present a multi-view inverse rendering method for large-scale real-wo...
research
01/10/2023

Neural Radiance Field Codebooks

Compositional representations of the world are a promising step towards ...
research
02/10/2022

Measuring disentangled generative spatio-temporal representation

Disentangled representation learning offers useful properties such as di...
research
06/15/2023

UrbanIR: Large-Scale Urban Scene Inverse Rendering from a Single Video

We show how to build a model that allows realistic, free-viewpoint rende...
research
05/31/2019

On the Fairness of Disentangled Representations

Recently there has been a significant interest in learning disentangled ...

Please sign up or login with your details

Forgot password? Click here to reset