Probabilistic Pixel-Adaptive Refinement Networks

03/31/2020
by   Anne S. Wannenwetsch, et al.
0

Encoder-decoder networks have found widespread use in various dense prediction tasks. However, the strong reduction of spatial resolution in the encoder leads to a loss of location information as well as boundary artifacts. To address this, image-adaptive post-processing methods have shown beneficial by leveraging the high-resolution input image(s) as guidance data. We extend such approaches by considering an important orthogonal source of information: the network's confidence in its own predictions. We introduce probabilistic pixel-adaptive convolutions (PPACs), which not only depend on image guidance data for filtering, but also respect the reliability of per-pixel predictions. As such, PPACs allow for image-adaptive smoothing and simultaneously propagating pixels of high confidence into less reliable regions, while respecting object boundaries. We demonstrate their utility in refinement networks for optical flow and semantic segmentation, where PPACs lead to a clear reduction in boundary artifacts. Moreover, our proposed refinement step is able to substantially improve the accuracy on various widely used benchmarks.

READ FULL TEXT

page 1

page 8

page 13

research
05/12/2020

Probabilistic Semantic Segmentation Refinement by Monte Carlo Region Growing

Semantic segmentation with fine-grained pixel-level accuracy is a fundam...
research
09/09/2019

Learning Task-Specific Generalized Convolutions in the Permutohedral Lattice

Dense prediction tasks typically employ encoder-decoder architectures, b...
research
08/06/2019

AttentionBoost: Learning What to Attend by Boosting Fully Convolutional Networks

Dense prediction models are widely used for image segmentation. One impo...
research
07/18/2017

The Devil is in the Decoder

Many machine vision applications require predictions for every pixel of ...
research
04/13/2018

Deep Motion Boundary Detection

Motion boundary detection is a crucial yet challenging problem. Prior me...
research
07/27/2019

Quadtree Generating Networks: Efficient Hierarchical Scene Parsing with Sparse Convolutions

Semantic segmentation with Convolutional Neural Networks is a memory-int...
research
05/23/2022

ConvPoseCNN2: Prediction and Refinement of Dense 6D Object Poses

Object pose estimation is a key perceptual capability in robotics. We pr...

Please sign up or login with your details

Forgot password? Click here to reset