MoPA: Multi-Modal Prior Aided Domain Adaptation for 3D Semantic Segmentation

09/21/2023
by   Haozhi Cao, et al.
0

Multi-modal unsupervised domain adaptation (MM-UDA) for 3D semantic segmentation is a practical solution to embed semantic understanding in autonomous systems without expensive point-wise annotations. While previous MM-UDA methods can achieve overall improvement, they suffer from significant class-imbalanced performance, restricting their adoption in real applications. This imbalanced performance is mainly caused by: 1) self-training with imbalanced data and 2) the lack of pixel-wise 2D supervision signals. In this work, we propose Multi-modal Prior Aided (MoPA) domain adaptation to improve the performance of rare objects. Specifically, we develop Valid Ground-based Insertion (VGI) to rectify the imbalance supervision signals by inserting prior rare objects collected from the wild while avoiding introducing artificial artifacts that lead to trivial solutions. Meanwhile, our SAM consistency loss leverages the 2D prior semantic masks from SAM as pixel-wise supervision signals to encourage consistent predictions for each object in the semantic mask. The knowledge learned from modal-specific prior is then shared across modalities to achieve better rare object segmentation. Extensive experiments show that our method achieves state-of-the-art performance on the challenging MM-UDA benchmark. Code will be available at https://github.com/AronCao49/MoPA.

READ FULL TEXT

page 1

page 3

page 4

page 6

research
09/26/2020

Affinity Space Adaptation for Semantic Segmentation Across Domains

Semantic segmentation with dense pixel-wise annotation has achieved exce...
research
01/18/2021

Cross-modal Learning for Domain Adaptation in 3D Semantic Segmentation

Domain adaptation is an important task to enable learning when labels ar...
research
04/06/2023

Exploiting the Complementarity of 2D and 3D Networks to Address Domain-Shift in 3D Semantic Segmentation

3D semantic segmentation is a critical task in many real-world applicati...
research
05/02/2022

Cross-Domain Correlation Distillation for Unsupervised Domain Adaptation in Nighttime Semantic Segmentation

The performance of nighttime semantic segmentation is restricted by the ...
research
11/28/2019

xMUDA: Cross-Modal Unsupervised Domain Adaptation for 3D Semantic Segmentation

Unsupervised Domain Adaptation (UDA) is crucial to tackle the lack of an...
research
03/18/2023

Multi-Modal Continual Test-Time Adaptation for 3D Semantic Segmentation

Continual Test-Time Adaptation (CTTA) generalizes conventional Test-Time...
research
06/05/2023

MM-DAG: Multi-task DAG Learning for Multi-modal Data – with Application for Traffic Congestion Analysis

This paper proposes to learn Multi-task, Multi-modal Direct Acyclic Grap...

Please sign up or login with your details

Forgot password? Click here to reset