Diffuse, Attend, and Segment: Unsupervised Zero-Shot Segmentation using Stable Diffusion

08/23/2023
by   Junjiao Tian, et al.
0

Producing quality segmentation masks for images is a fundamental problem in computer vision. Recent research has explored large-scale supervised training to enable zero-shot segmentation on virtually any image style and unsupervised training to enable segmentation without dense annotations. However, constructing a model capable of segmenting anything in a zero-shot manner without any annotations is still challenging. In this paper, we propose to utilize the self-attention layers in stable diffusion models to achieve this goal because the pre-trained stable diffusion model has learned inherent concepts of objects within its attention layers. Specifically, we introduce a simple yet effective iterative merging process based on measuring KL divergence among attention maps to merge them into valid segmentation masks. The proposed method does not require any training or language dependency to extract quality segmentation for any images. On COCO-Stuff-27, our method surpasses the prior unsupervised zero-shot SOTA method by an absolute 26 in mean IoU.

READ FULL TEXT

page 4

page 5

page 7

page 8

page 13

page 14

research
04/05/2023

Segment Anything

We introduce the Segment Anything (SA) project: a new task, model, and d...
research
09/06/2023

SLiMe: Segment Like Me

Significant strides have been made using large vision-language models, l...
research
09/21/2023

Deshadow-Anything: When Segment Anything Model Meets Zero-shot shadow removal

Segment Anything (SAM), an advanced universal image segmentation model t...
research
05/25/2023

DiffCLIP: Leveraging Stable Diffusion for Language Grounded 3D Classification

Large pre-trained models have had a significant impact on computer visio...
research
09/22/2021

NudgeSeg: Zero-Shot Object Segmentation by Repeated Physical Interaction

Recent advances in object segmentation have demonstrated that deep neura...
research
06/14/2022

ReCo: Retrieve and Co-segment for Zero-shot Transfer

Semantic segmentation has a broad range of applications, but its real-wo...
research
05/04/2023

Personalize Segment Anything Model with One Shot

Driven by large-data pre-training, Segment Anything Model (SAM) has been...

Please sign up or login with your details

Forgot password? Click here to reset