ViewFormer: View Set Attention for Multi-view 3D Shape Understanding

04/29/2023
by   Hongyu Sun, et al.
0

This paper presents ViewFormer, a simple yet effective model for multi-view 3d shape recognition and retrieval. We systematically investigate the existing methods for aggregating multi-view information and propose a novel “view set" perspective, which minimizes the relation assumption about the views and releases the representation flexibility. We devise an adaptive attention model to capture pairwise and higher-order correlations of the elements in the view set. The learned multi-view correlations are aggregated into an expressive view set descriptor for recognition and retrieval. Experiments show the proposed method unleashes surprising capabilities across different tasks and datasets. For instance, with only 2 attention blocks and 4.8M learnable parameters, ViewFormer reaches 98.8 exceeding previous best method by 1.1 method achieves 98.4 improvement over the strongest baseline. ViewFormer also sets new records in several evaluation dimensions of 3D shape retrieval defined on the SHREC'17 benchmark.

READ FULL TEXT
research
08/27/2019

HRGE-Net: Hierarchical Relational Graph Embedding Network for Multi-view 3D Shape Recognition

View-based approach that recognizes 3D shape through its projected 2D im...
research
01/28/2022

Higher Order Correlation Analysis for Multi-View Learning

Multi-view learning is frequently used in data science. The pairwise cor...
research
08/16/2021

Learning Canonical View Representation for 3D Shape Recognition with Arbitrary Views

In this paper, we focus on recognizing 3D shapes from arbitrary views, i...
research
08/06/2019

View N-gram Network for 3D Object Retrieval

How to aggregate multi-view representations of a 3D shape object into an...
research
02/28/2020

MANet: Multimodal Attention Network based Point- View fusion for 3D Shape Recognition

3D shape recognition has attracted more and more attention as a task of ...
research
04/29/2020

A generalized kernel machine approach to identify higher-order composite effects in multi-view datasets

In recent years, a comprehensive study of multi-view datasets (e.g., mul...
research
03/29/2023

ViewRefer: Grasp the Multi-view Knowledge for 3D Visual Grounding with GPT and Prototype Guidance

Understanding 3D scenes from multi-view inputs has been proven to allevi...

Please sign up or login with your details

Forgot password? Click here to reset