DFormer: Diffusion-guided Transformer for Universal Image Segmentation

06/06/2023
by   Hefeng Wang, et al.
0

This paper introduces an approach, named DFormer, for universal image segmentation. The proposed DFormer views universal image segmentation task as a denoising process using a diffusion model. DFormer first adds various levels of Gaussian noise to ground-truth masks, and then learns a model to predict denoising masks from corrupted masks. Specifically, we take deep pixel-level features along with the noisy masks as inputs to generate mask features and attention masks, employing diffusion-based decoder to perform mask prediction gradually. At inference, our DFormer directly predicts the masks and corresponding categories from a set of randomly-generated masks. Extensive experiments reveal the merits of our proposed contributions on different image segmentation tasks: panoptic segmentation, instance segmentation, and semantic segmentation. Our DFormer outperforms the recent diffusion-based panoptic segmentation method Pix2Seq-D with a gain of 3.6 Further, DFormer achieves promising semantic segmentation performance outperforming the recent diffusion-based method by 2.2 source code and models will be publicly on https://github.com/cp3wan/DFormer

READ FULL TEXT

page 4

page 9

page 10

page 11

page 12

research
03/10/2023

Importance of Aligning Training Strategy with Evaluation for Diffusion Models in 3D Multiclass Segmentation

Recently, denoising diffusion probabilistic models (DDPM) have been appl...
research
06/02/2023

Denoising Diffusion Semantic Segmentation with Mask Prior Modeling

The evolution of semantic segmentation has long been dominated by learni...
research
02/14/2023

PolyFormer: Referring Image Segmentation as Sequential Polygon Generation

In this work, instead of directly predicting the pixel-level segmentatio...
research
08/30/2023

A Recycling Training Strategy for Medical Image Segmentation with Diffusion Denoising Models

Denoising diffusion models have found applications in image segmentation...
research
12/06/2021

Diffusion Models for Implicit Image Segmentation Ensembles

Diffusion models have shown impressive performance for generative modell...
research
08/11/2023

FoodSAM: Any Food Segmentation

In this paper, we explore the zero-shot capability of the Segment Anythi...
research
04/23/2023

PiClick: Picking the desired mask in click-based interactive segmentation

Click-based interactive segmentation enables productive pixel-level anno...

Please sign up or login with your details

Forgot password? Click here to reset