SAM3D: Zero-Shot 3D Object Detection via Segment Anything Model

06/04/2023
by   Dingyuan Zhang, et al.
0

With the development of large language models, many remarkable linguistic systems like ChatGPT have thrived and achieved astonishing success on many tasks, showing the incredible power of foundation models. In the spirit of unleashing the capability of foundation models on vision tasks, the Segment Anything Model (SAM), a vision foundation model for image segmentation, has been proposed recently and presents strong zero-shot ability on many downstream 2D tasks. However, whether SAM can be adapted to 3D vision tasks has yet to be explored, especially 3D object detection. With this inspiration, we explore adapting the zero-shot ability of SAM to 3D object detection in this paper. We propose a SAM-powered BEV processing pipeline to detect objects and get promising results on the large-scale Waymo open dataset. As an early attempt, our method takes a step toward 3D object detection with vision foundation models and presents the opportunity to unleash their power on 3D vision tasks. The code is released at https://github.com/DYZhang09/SAM3D.

READ FULL TEXT

page 1

page 3

page 5

research
11/22/2021

Florence: A New Foundation Model for Computer Vision

Automated visual understanding of our diverse and open world demands com...
research
06/30/2023

Zero-shot Nuclei Detection via Visual-Language Pre-trained Models

Large-scale visual-language pre-trained models (VLPM) have proven their ...
research
01/26/2023

Vision-Language Models Performing Zero-Shot Tasks Exhibit Gender-based Disparities

We explore the extent to which zero-shot vision-language models exhibit ...
research
08/12/2023

TongueSAM: An Universal Tongue Segmentation Model Based on SAM with Zero-Shot

Tongue segmentation serves as the primary step in automated TCM tongue d...
research
11/17/2022

InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges

In this report, we present our champion solutions to five tracks at Ego4...
research
04/29/2023

Segment Anything Model (SAM) Meets Glass: Mirror and Transparent Objects Cannot Be Easily Detected

Meta AI Research has recently released SAM (Segment Anything Model) whic...
research
05/17/2023

SAM for Poultry Science

In recent years, the agricultural industry has witnessed significant adv...

Please sign up or login with your details

Forgot password? Click here to reset