Missing Modality Robustness in Semi-Supervised Multi-Modal Semantic Segmentation

04/21/2023
by   Harsh Maheshwari, et al.
0

Using multiple spatial modalities has been proven helpful in improving semantic segmentation performance. However, there are several real-world challenges that have yet to be addressed: (a) improving label efficiency and (b) enhancing robustness in realistic scenarios where modalities are missing at the test time. To address these challenges, we first propose a simple yet efficient multi-modal fusion mechanism Linear Fusion, that performs better than the state-of-the-art multi-modal models even with limited supervision. Second, we propose M3L: Multi-modal Teacher for Masked Modality Learning, a semi-supervised framework that not only improves the multi-modal performance but also makes the model robust to the realistic missing modality scenario using unlabeled data. We create the first benchmark for semi-supervised multi-modal semantic segmentation and also report the robustness to missing modalities. Our proposal shows an absolute improvement of up to 10 mIoU above the most competitive baselines. Our code is available at https://github.com/harshm121/M3L

READ FULL TEXT

page 5

page 12

page 16

page 17

research
11/25/2022

Towards Good Practices for Missing Modality Robust Action Recognition

Standard multi-modal models assume the use of the same modalities in tra...
research
08/09/2018

Overcoming Missing and Incomplete Modalities with Generative Adversarial Networks for Building Footprint Segmentation

The integration of information acquired with different modalities, spati...
research
05/13/2021

Robust Dynamic Multi-Modal Data Fusion: A Model Uncertainty Perspective

This paper is concerned with multi-modal data fusion (MMDF) under unexpe...
research
03/23/2022

On Adversarial Robustness of Large-scale Audio Visual Learning

As audio-visual systems are being deployed for safety-critical tasks suc...
research
05/23/2018

Semi-supervised classification by reaching consensus among modalities

This paper introduces transductive consensus network (TCNs), as an exten...
research
07/30/2018

Modular Sensor Fusion for Semantic Segmentation

Sensor fusion is a fundamental process in robotic systems as it extends ...
research
09/25/2019

Multi-modal segmentation with missing MR sequences using pre-trained fusion networks

Missing data is a common problem in machine learning and in retrospectiv...

Please sign up or login with your details

Forgot password? Click here to reset