DejaVu: Conditional Regenerative Learning to Enhance Dense Prediction

03/02/2023
by   Shubhankar Borse, et al.
0

We present DejaVu, a novel framework which leverages conditional image regeneration as additional supervision during training to improve deep networks for dense prediction tasks such as segmentation, depth estimation, and surface normal prediction. First, we apply redaction to the input image, which removes certain structural information by sparse sampling or selective frequency removal. Next, we use a conditional regenerator, which takes the redacted image and the dense predictions as inputs, and reconstructs the original image by filling in the missing structural information. In the redacted image, structural attributes like boundaries are broken while semantic context is largely preserved. In order to make the regeneration feasible, the conditional generator will then require the structure information from the other input source, i.e., the dense predictions. As such, by including this conditional regeneration objective during training, DejaVu encourages the base network to learn to embed accurate scene structure in its dense prediction. This leads to more accurate predictions with clearer boundaries and better spatial consistency. When it is feasible to leverage additional computation, DejaVu can be extended to incorporate an attention-based regeneration module within the dense prediction network, which further improves accuracy. Through extensive experiments on multiple dense prediction benchmarks such as Cityscapes, COCO, ADE20K, NYUD-v2, and KITTI, we demonstrate the efficacy of employing DejaVu during training, as it outperforms SOTA methods at no added computation cost.

READ FULL TEXT

page 1

page 3

page 4

page 5

page 6

page 7

research
12/02/2018

DeepLiDAR: Deep Surface Normal Guided Depth Prediction for Outdoor Scene from Sparse LiDAR Data and Single Color Image

In this paper, we propose a deep learning architecture that produces acc...
research
03/25/2018

Deep Depth Completion of a Single RGB-D Image

The goal of our work is to complete the depth channel of an RGB-D image....
research
09/03/2018

Detail Preserving Depth Estimation from a Single Image Using Attention Guided Networks

Convolutional Neural Networks have demonstrated superior performance on ...
research
03/30/2023

DDP: Diffusion Model for Dense Visual Prediction

We propose a simple, efficient, yet powerful framework for dense visual ...
research
02/27/2017

Fast and Accurate Inference with Adaptive Ensemble Prediction in Image Classification with Deep Neural Networks

Ensembling multiple predictions is a widely used technique to improve th...
research
03/03/2022

Fast Neural Architecture Search for Lightweight Dense Prediction Networks

We present LDP, a lightweight dense prediction neural architecture searc...
research
09/21/2019

Efficient Surface-Aware Semi-Global Matching with Multi-View Plane-Sweep Sampling

Online augmentation of an oblique aerial image sequence with structural ...

Please sign up or login with your details

Forgot password? Click here to reset