PDFNet: Pointwise Dense Flow Network for Urban-Scene Segmentation

In recent years, using a deep convolutional neural network (CNN) as a feature encoder (or backbone) is the most commonly observed architectural pattern in several computer vision methods, and semantic segmentation is no exception. The two major drawbacks of this architectural pattern are: (i) the networks often fail to capture small classes such as wall, fence, pole, traffic light, traffic sign, and bicycle, which are crucial for autonomous vehicles to make accurate decisions. (ii) due to the arbitrarily increasing depth, the networks require massive labeled data and additional regularization techniques to converge and to prevent the risk of over-fitting, respectively. While regularization techniques come at minimal cost, the collection of labeled data is an expensive and laborious process. In this work, we address these two drawbacks by proposing a novel lightweight architecture named point-wise dense flow network (PDFNet). In PDFNet, we employ dense, residual, and multiple shortcut connections to allow a smooth gradient flow to all parts of the network. The extensive experiments on Cityscapes and CamVid benchmarks demonstrate that our method significantly outperforms baselines in capturing small classes and in few-data regimes. Moreover, our method achieves considerable performance in classifying out-of-the training distribution samples, evaluated on Cityscapes to KITTI dataset.

READ FULL TEXT

page 15

page 22

research
01/21/2021

Ikshana: A Theory of Human Scene Understanding Mechanism

In recent years, deep neural networks achieved state-of-the-art performa...
research
04/06/2021

Latent Space Regularization for Unsupervised Domain Adaptation in Semantic Segmentation

Deep convolutional neural networks for semantic segmentation allow to ac...
research
04/12/2019

PWOC-3D: Deep Occlusion-Aware End-to-End Scene Flow Estimation

In the last few years, convolutional neural networks (CNNs) have demonst...
research
11/01/2021

Neural Scene Flow Prior

Before the deep learning revolution, many perception algorithms were bas...
research
03/16/2019

Real time backbone for semantic segmentation

The rapid development of autonomous driving in recent years presents lot...
research
06/22/2020

ResFPN: Residual Skip Connections in Multi-Resolution Feature Pyramid Networks for Accurate Dense Pixel Matching

Dense pixel matching is required for many computer vision algorithms suc...

Please sign up or login with your details

Forgot password? Click here to reset