LegoFormer: Transformers for Block-by-Block Multi-view 3D Reconstruction

06/23/2021
by   Farid Yagubbayli, et al.
0

Most modern deep learning-based multi-view 3D reconstruction techniques use RNNs or fusion modules to combine information from multiple images after encoding them. These two separate steps have loose connections and do not consider all available information while encoding each view. We propose LegoFormer, a transformer-based model that unifies object reconstruction under a single framework and parametrizes the reconstructed occupancy grid by its decomposition factors. This reformulation allows the prediction of an object as a set of independent structures then aggregated to obtain the final reconstruction. Experiments conducted on ShapeNet display the competitive performance of our network with respect to the state-of-the-art methods. We also demonstrate how the use of self-attention leads to increased interpretability of the model output.

READ FULL TEXT

page 7

page 17

page 18

research
03/24/2021

Multi-view 3D Reconstruction with Transformer

Deep CNN-based methods have so far achieved the state of the art results...
research
04/08/2022

From 2D Images to 3D Model:Weakly Supervised Multi-View Face Reconstruction with Deep Fusion

We consider the problem of Multi-view 3D Face Reconstruction (MVR) with ...
research
12/01/2021

VoRTX: Volumetric 3D Reconstruction With Transformers for Voxelwise View Selection and Fusion

Recent volumetric 3D reconstruction methods can produce very accurate re...
research
01/01/2022

Self-attention Multi-view Representation Learning with Diversity-promoting Complementarity

Multi-view learning attempts to generate a model with a better performan...
research
05/29/2022

3D-C2FT: Coarse-to-fine Transformer for Multi-view 3D Reconstruction

Recently, the transformer model has been successfully employed for the m...
research
01/31/2019

Pix2Vox: Context-aware 3D Reconstruction from Single and Multi-view Images

Recovering the 3D representation of an object from single-view or multi-...
research
12/19/2020

The importance of silhouette optimization in 3D shape reconstruction system from multiple object scenes

This paper presents a multi stage 3D shape reconstruction system of mult...

Please sign up or login with your details

Forgot password? Click here to reset