3D-C2FT: Coarse-to-fine Transformer for Multi-view 3D Reconstruction

05/29/2022
by   Leslie Ching Ow Tiong, et al.
0

Recently, the transformer model has been successfully employed for the multi-view 3D reconstruction problem. However, challenges remain on designing an attention mechanism to explore the multiview features and exploit their relations for reinforcing the encoding-decoding modules. This paper proposes a new model, namely 3D coarse-to-fine transformer (3D-C2FT), by introducing a novel coarse-to-fine(C2F) attention mechanism for encoding multi-view features and rectifying defective 3D objects. C2F attention mechanism enables the model to learn multi-view information flow and synthesize 3D surface correction in a coarse to fine-grained manner. The proposed model is evaluated by ShapeNet and Multi-view Real-life datasets. Experimental results show that 3D-C2FT achieves notable results and outperforms several competing models on these datasets.

READ FULL TEXT

page 18

page 23

research
03/24/2021

Multi-view 3D Reconstruction with Transformer

Deep CNN-based methods have so far achieved the state of the art results...
research
08/23/2023

Reranking Passages with Coarse-to-Fine Neural Retriever using List-Context Information

Passage reranking is a crucial task in many applications, particularly w...
research
09/14/2020

Collaborative Attention Mechanism for Multi-View Action Recognition

Multi-view action recognition (MVAR) leverages complementary temporal in...
research
03/29/2023

Self-accumulative Vision Transformer for Bone Age Assessment Using the Sauvegrain Method

This study presents a novel approach to bone age assessment (BAA) using ...
research
04/19/2023

ASM: Adaptive Skinning Model for High-Quality 3D Face Modeling

The research fields of parametric face models and 3D face reconstruction...
research
02/23/2022

Paying U-Attention to Textures: Multi-Stage Hourglass Vision Transformer for Universal Texture Synthesis

We present a novel U-Attention vision Transformer for universal texture ...
research
06/23/2021

LegoFormer: Transformers for Block-by-Block Multi-view 3D Reconstruction

Most modern deep learning-based multi-view 3D reconstruction techniques ...

Please sign up or login with your details

Forgot password? Click here to reset