Improving Semantic Segmentation via Video Propagation and Label Relaxation

12/04/2018
by   Yi Zhu, et al.
0

Semantic segmentation requires large amounts of pixel-wise annotations to learn accurate models. In this paper, we present a video prediction-based methodology to scale up training sets by synthesizing new training samples in order to improve the accuracy of semantic segmentation networks. We exploit video prediction models' ability to predict future frames in order to also predict future labels. A joint propagation strategy is also proposed to alleviate mis-alignments in synthesized samples. We demonstrate that training segmentation models on datasets augmented by the synthesized samples leads to significant improvements in accuracy. Furthermore, we introduce a novel boundary label relaxation technique that makes training robust to annotation noise and propagation artifacts along object boundaries. Our proposed methods achieve state-of-the-art mIoUs of 83.5 single model, without model ensembles, achieves 72.8 semantic segmentation test set, which surpasses the winning entry of the ROB challenge 2018. Our code and videos can be found at https://nv-adlr.github.io/publication/2018-Segmentation.

READ FULL TEXT

page 3

page 4

page 6

page 7

page 8

page 13

research
10/22/2019

Towards Automatic Annotation for Semantic Segmentation in Drone Videos

Semantic segmentation is a crucial task for robot navigation and safety....
research
06/11/2023

3rd Place Solution for PVUW Challenge 2023: Video Panoptic Segmentation

In order to deal with the task of video panoptic segmentation in the wil...
research
06/19/2020

Video Panoptic Segmentation

Panoptic segmentation has become a new standard of visual recognition ta...
research
11/22/2021

Improving Semantic Image Segmentation via Label Fusion in Semantically Textured Meshes

Models for semantic segmentation require a large amount of hand-labeled ...
research
07/07/2021

WeClick: Weakly-Supervised Video Semantic Segmentation with Click Annotations

Compared with tedious per-pixel mask annotating, it is much easier to an...
research
04/06/2023

A Closer Look at Audio-Visual Semantic Segmentation

Audio-visual segmentation (AVS) is a complex task that involves accurate...
research
04/16/2019

Devil is in the Edges: Learning Semantic Boundaries from Noisy Annotations

We tackle the problem of semantic boundary prediction, which aims to ide...

Please sign up or login with your details

Forgot password? Click here to reset