Personalize Segment Anything Model with One Shot

05/04/2023
by   Renrui Zhang, et al.
4

Driven by large-data pre-training, Segment Anything Model (SAM) has been demonstrated as a powerful and promptable framework, revolutionizing the segmentation models. Despite the generality, customizing SAM for specific visual concepts without man-powered prompting is under explored, e.g., automatically segmenting your pet dog in different images. In this paper, we propose a training-free Personalization approach for SAM, termed as PerSAM. Given only a single image with a reference mask, PerSAM first localizes the target concept by a location prior, and segments it within other images or videos via three techniques: target-guided attention, target-semantic prompting, and cascaded post-refinement. In this way, we effectively adapt SAM for private use without any training. To further alleviate the mask ambiguity, we present an efficient one-shot fine-tuning variant, PerSAM-F. Freezing the entire SAM, we introduce two learnable weights for multi-scale masks, only training 2 parameters within 10 seconds for improved performance. To demonstrate our efficacy, we construct a new segmentation dataset, PerSeg, for personalized evaluation, and test our methods on video object segmentation with competitive performance. Besides, our approach can also enhance DreamBooth to personalize Stable Diffusion for text-to-image generation, which discards the background disturbance for better target appearance learning. Code is released at https://github.com/ZrrSkywalker/Personalize-SAM

READ FULL TEXT

page 1

page 2

page 3

page 7

page 9

page 10

page 11

research
07/03/2023

Segment Anything Meets Point Tracking

The Segment Anything Model (SAM) has established itself as a powerful ze...
research
06/30/2023

Training-free Object Counting with Prompts

This paper tackles the problem of object counting in images. Existing ap...
research
04/24/2023

Segment Anything in 3D with NeRFs

The Segment Anything Model (SAM) has demonstrated its effectiveness in s...
research
03/10/2023

Importance of Aligning Training Strategy with Evaluation for Diffusion Models in 3D Multiclass Segmentation

Recently, denoising diffusion probabilistic models (DDPM) have been appl...
research
08/23/2023

Diffuse, Attend, and Segment: Unsupervised Zero-Shot Segmentation using Stable Diffusion

Producing quality segmentation masks for images is a fundamental problem...
research
09/06/2023

SLiMe: Segment Like Me

Significant strides have been made using large vision-language models, l...
research
06/25/2023

Faster Segment Anything: Towards Lightweight SAM for Mobile Applications

Segment anything model (SAM) is a prompt-guided vision foundation model ...

Please sign up or login with your details

Forgot password? Click here to reset