R2-MLP: Round-Roll MLP for Multi-View 3D Object Recognition

11/20/2022
by   Shuo Chen, et al.
0

Recently, vision architectures based exclusively on multi-layer perceptrons (MLPs) have gained much attention in the computer vision community. MLP-like models achieve competitive performance on a single 2D image classification with less inductive bias without hand-crafted convolution layers. In this work, we explore the effectiveness of MLP-based architecture for the view-based 3D object recognition task. We present an MLP-based architecture termed as Round-Roll MLP (R^2-MLP). It extends the spatial-shift MLP backbone by considering the communications between patches from different views. R^2-MLP rolls part of the channels along the view dimension and promotes information exchange between neighboring views. We benchmark MLP results on ModelNet10 and ModelNet40 datasets with ablations in various aspects. The experimental results show that, with a conceptually simple structure, our R^2-MLP achieves competitive performance compared with existing state-of-the-art methods.

READ FULL TEXT
research
10/25/2021

MVT: Multi-view Vision Transformer for 3D Object Recognition

Inspired by the great success achieved by CNN in image recognition, view...
research
08/02/2021

S^2-MLPv2: Improved Spatial-Shift MLP Architecture for Vision

Recently, MLP-based vision backbones emerge. MLP-based vision architectu...
research
06/04/2019

Dominant Set Clustering and Pooling for Multi-View 3D Object Recognition

View based strategies for 3D object recognition have proven to be very s...
research
05/26/2016

Pairwise Decomposition of Image Sequences for Active Multi-View Recognition

A multi-view image sequence provides a much richer capacity for object r...
research
12/02/2007

View Based Methods can achieve Bayes-Optimal 3D Recognition

This paper proves that visual object recognition systems using only 2D E...
research
10/14/2016

Recurrent 3D Attentional Networks for End-to-End Active Object Recognition in Cluttered Scenes

Active vision is inherently attention-driven: The agent selects views of...
research
05/28/2023

Using Caterpillar to Nibble Small-Scale Images

Recently, MLP-based models have become popular and attained significant ...

Please sign up or login with your details

Forgot password? Click here to reset