IRISformer: Dense Vision Transformers for Single-Image Inverse Rendering in Indoor Scenes

06/16/2022
by   Rui Zhu, et al.
0

Indoor scenes exhibit significant appearance variations due to myriad interactions between arbitrarily diverse object shapes, spatially-changing materials, and complex lighting. Shadows, highlights, and inter-reflections caused by visible and invisible light sources require reasoning about long-range interactions for inverse rendering, which seeks to recover the components of image formation, namely, shape, material, and lighting. In this work, our intuition is that the long-range attention learned by transformer architectures is ideally suited to solve longstanding challenges in single-image inverse rendering. We demonstrate with a specific instantiation of a dense vision transformer, IRISformer, that excels at both single-task and multi-task reasoning required for inverse rendering. Specifically, we propose a transformer architecture to simultaneously estimate depths, normals, spatially-varying albedo, roughness and lighting from a single image of an indoor scene. Our extensive evaluations on benchmark datasets demonstrate state-of-the-art results on each of the above tasks, enabling applications like object insertion and material editing in a single unconstrained real image, with greater photorealism than prior works. Code and data are publicly released at https://github.com/ViLab-UCSD/IRISformer.

READ FULL TEXT

page 6

page 8

page 10

page 11

page 12

page 13

page 14

page 16

research
05/07/2019

Inverse Rendering for Complex Indoor Scenes: Shape, Spatially-Varying Lighting and SVBRDF from a Single Image

We propose a deep inverse rendering framework for indoor scenes. From a ...
research
09/13/2021

Learning Indoor Inverse Rendering with 3D Spatially-Varying Lighting

In this work, we address the problem of jointly estimating albedo, norma...
research
01/08/2019

Neural Inverse Rendering of an Indoor Scene from a Single Image

Inverse rendering aims to estimate physical scene attributes (e.g., refl...
research
04/12/2023

Factorized Inverse Path Tracing for Efficient and Accurate Material-Lighting Estimation

Inverse path tracing has recently been applied to joint material and lig...
research
03/24/2023

Weakly-supervised Single-view Image Relighting

We present a learning-based approach to relight a single image of Lamber...
research
03/01/2021

Generative Adversarial Transformers

We introduce the GANsformer, a novel and efficient type of transformer, ...
research
07/06/2023

PSDR-Room: Single Photo to Scene using Differentiable Rendering

A 3D digital scene contains many components: lights, materials and geome...

Please sign up or login with your details

Forgot password? Click here to reset