DDP: Diffusion Model for Dense Visual Prediction

03/30/2023
by   Yuanfeng Ji, et al.
0

We propose a simple, efficient, yet powerful framework for dense visual predictions based on the conditional diffusion pipeline. Our approach follows a "noise-to-map" generative paradigm for prediction by progressively removing noise from a random Gaussian distribution, guided by the image. The method, called DDP, efficiently extends the denoising diffusion process into the modern perception pipeline. Without task-specific design and architecture customization, DDP is easy to generalize to most dense prediction tasks, e.g., semantic segmentation and depth estimation. In addition, DDP shows attractive properties such as dynamic inference and uncertainty awareness, in contrast to previous single-step discriminative methods. We show top results on three representative tasks with six diverse benchmarks, without tricks, DDP achieves state-of-the-art or competitive performance on each task compared to the specialist counterparts. For example, semantic segmentation (83.9 mIoU on Cityscapes), BEV map segmentation (70.6 mIoU on nuScenes), and depth estimation (0.05 REL on KITTI). We hope that our approach will serve as a solid baseline and facilitate future research

READ FULL TEXT

page 1

page 2

page 5

page 13

page 15

page 16

page 17

research
03/09/2023

DiffusionDepth: Diffusion Denoising Approach for Monocular Depth Estimation

Monocular depth estimation is a challenging task that predicts the pixel...
research
02/18/2022

Joint Learning of Frequency and Spatial Domains for Dense Predictions

Current artificial neural networks mainly conduct the learning process i...
research
10/24/2021

X-Distill: Improving Self-Supervised Monocular Depth via Cross-Task Distillation

In this paper, we propose a novel method, X-Distill, to improve the self...
research
12/06/2022

DiffusionInst: Diffusion Model for Instance Segmentation

Recently, diffusion frameworks have achieved comparable performance with...
research
03/02/2023

DejaVu: Conditional Regenerative Learning to Enhance Dense Prediction

We present DejaVu, a novel framework which leverages conditional image r...
research
10/06/2021

See Yourself in Others: Attending Multiple Tasks for Own Failure Detection

Autonomous robots deal with unexpected scenarios in real environments. G...
research
12/01/2022

Multi-Class Segmentation from Aerial Views using Recursive Noise Diffusion

Semantic segmentation from aerial views is a vital task for autonomous d...

Please sign up or login with your details

Forgot password? Click here to reset