Source-Free Domain Adaptation for RGB-D Semantic Segmentation with Vision Transformers

05/23/2023
by   Giulia Rizzoli, et al.
0

With the increasing availability of depth sensors, multimodal frameworks that combine color information with depth data are attracting increasing interest. In the challenging task of semantic segmentation, depth maps allow to distinguish between similarly colored objects at different depths and provide useful geometric cues. On the other side, ground truth data for semantic segmentation is burdensome to be provided and thus domain adaptation is another significant research area. Specifically, we address the challenging source-free domain adaptation setting where the adaptation is performed without reusing source data. We propose MISFIT: MultImodal Source-Free Information fusion Transformer, a depth-aware framework which injects depth information into a segmentation module based on vision transformers at multiple stages, namely at the input, feature and output levels. Color and depth style transfer helps early-stage domain alignment while re-wiring self-attention between modalities creates mixed features allowing the extraction of better semantic content. Furthermore, a depth-based entropy minimization strategy is also proposed to adaptively weight regions at different distances. Our framework, which is also the first approach using vision transformers for source-free semantic segmentation, shows noticeable performance improvements with respect to standard strategies.

READ FULL TEXT

page 1

page 3

page 4

page 6

research
04/03/2019

DADA: Depth-aware Domain Adaptation in Semantic Segmentation

Unsupervised domain adaptation (UDA) is important for applications where...
research
11/08/2022

DepthFormer: Multimodal Positional Encodings and Cross-Input Attention for Transformer-Based Segmentation Networks

Most approaches for semantic segmentation use only information from colo...
research
04/29/2023

Regularizing Self-training for Unsupervised Domain Adaptation via Structural Constraints

Self-training based on pseudo-labels has emerged as a dominant approach ...
research
11/27/2022

Exploring Consistency in Cross-Domain Transformer for Domain Adaptive Semantic Segmentation

While transformers have greatly boosted performance in semantic segmenta...
research
03/07/2022

Semantic Segmentation in Art Paintings

Semantic segmentation is a difficult task even when trained in a supervi...
research
11/16/2022

ELDA: Using Edges to Have an Edge on Semantic Segmentation Based UDA

Many unsupervised domain adaptation (UDA) methods have been proposed to ...
research
06/23/2021

Probabilistic Attention for Interactive Segmentation

We provide a probabilistic interpretation of attention and show that the...

Please sign up or login with your details

Forgot password? Click here to reset