Domain Adaptive Video Segmentation via Temporal Consistency Regularization

07/23/2021
by   Dayan Guan, et al.
7

Video semantic segmentation is an essential task for the analysis and understanding of videos. Recent efforts largely focus on supervised video segmentation by learning from fully annotated data, but the learnt models often experience clear performance drop while applied to videos of a different domain. This paper presents DA-VSN, a domain adaptive video segmentation network that addresses domain gaps in videos by temporal consistency regularization (TCR) for consecutive frames of target-domain videos. DA-VSN consists of two novel and complementary designs. The first is cross-domain TCR that guides the prediction of target frames to have similar temporal consistency as that of source frames (learnt from annotated source data) via adversarial learning. The second is intra-domain TCR that guides unconfident predictions of target frames to have similar temporal consistency as confident predictions of target frames. Extensive experiments demonstrate the superiority of our proposed domain adaptive video segmentation network which outperforms multiple baselines consistently by large margins.

READ FULL TEXT

page 1

page 3

page 4

page 5

page 7

research
07/06/2022

Domain Adaptive Video Segmentation via Temporal Pseudo Supervision

Video semantic segmentation has achieved great progress under the superv...
research
11/04/2022

Domain Adaptive Video Semantic Segmentation via Cross-Domain Moving Object Mixing

The network trained for domain adaptation is prone to bias toward the ea...
research
08/13/2022

Memory Efficient Temporal Visual Graph Model for Unsupervised Video Domain Adaptation

Existing video domain adaption (DA) methods need to store all temporal c...
research
06/19/2023

A spatio-temporal network for video semantic segmentation in surgical videos

Semantic segmentation in surgical videos has applications in intra-opera...
research
12/11/2020

Intrinsic Temporal Regularization for High-resolution Human Video Synthesis

Temporal consistency is crucial for extending image processing pipelines...
research
09/03/2016

Towards Segmenting Consumer Stereo Videos: Benchmark, Baselines and Ensembles

Are we ready to segment consumer stereo videos? The amount of this data ...
research
07/14/2022

Tackling Background Distraction in Video Object Segmentation

Semi-supervised video object segmentation (VOS) aims to densely track ce...

Please sign up or login with your details

Forgot password? Click here to reset