Log In Sign Up

Edge-Preserving Guided Semantic Segmentation for VIPriors Challenge

by   Chih-Chung Hsu, et al.

Semantic segmentation is one of the most attractive research fields in computer vision. In the VIPriors challenge, only very limited numbers of training samples are allowed, leading to that the current state-of-the-art and deep learning-based semantic segmentation techniques are hard to train well. To overcome this shortcoming, therefore, we propose edge-preserving guidance to obtain the extra prior information, to avoid the overfitting under small-scale training dataset. First, a two-channeled convolutional layer is concatenated to the last layer of the conventional semantic segmentation network. Then, an edge map is calculated from the ground truth by Sobel operation and followed by concatenating a hard-thresholding operation to indicate whether the pixel is the edge or not. Then, the two-dimensional cross-entropy loss is adopted to calculate the loss between the predicted edge map and its ground truth, termed as an edge-preserving loss. In this way, the continuity of boundaries between different instances can be forced by the proposed edge-preserving loss. Experiments demonstrate that the proposed method can achieve excellent performance under small-scale training set, compared to state-of-the-art semantic segmentation techniques.


page 1

page 2

page 3

page 4


SEMEDA: Enhancing Segmentation Precision with Semantic Edge Aware Loss

While nowadays deep neural networks achieve impressive performances on s...

Scaling Semantic Segmentation Beyond 1K Classes on a Single GPU

The state-of-the-art object detection and image classification methods c...

JSENet: Joint Semantic Segmentation and Edge Detection Network for 3D Point Clouds

Semantic segmentation and semantic edge detection can be seen as two dua...

Distance Map Loss Penalty Term for Semantic Segmentation

Convolutional neural networks for semantic segmentation suffer from low ...

Class Based Thresholding in Early Exit Semantic Segmentation Networks

We propose Class Based Thresholding (CBT) to reduce the computational co...

CAFENet: Class-Agnostic Few-Shot Edge Detection Network

We tackle a novel few-shot learning challenge, which we call few-shot se...

Multi-domain semantic segmentation with pyramidal fusion

We present our submission to the semantic segmentation contest of the Ro...