Calibrating Multimodal Learning

06/02/2023
by   Huan Ma. Qingyang Zhang, et al.
0

Multimodal machine learning has achieved remarkable progress in a wide range of scenarios. However, the reliability of multimodal learning remains largely unexplored. In this paper, through extensive empirical studies, we identify current multimodal classification methods suffer from unreliable predictive confidence that tend to rely on partial modalities when estimating confidence. Specifically, we find that the confidence estimated by current models could even increase when some modalities are corrupted. To address the issue, we introduce an intuitive principle for multimodal learning, i.e., the confidence should not increase when one modality is removed. Accordingly, we propose a novel regularization technique, i.e., Calibrating Multimodal Learning (CML) regularization, to calibrate the predictive confidence of previous methods. This technique could be flexibly equipped by existing models and improve the performance in terms of confidence calibration, classification accuracy, and model robustness.

READ FULL TEXT

page 2

page 16

page 20

research
08/21/2018

LRMM: Learning to Recommend with Missing Modalities

Multimodal learning has shown promising performance in content-based rec...
research
11/11/2021

Trustworthy Multimodal Regression with Mixture of Normal-inverse Gamma Distributions

Multimodal regression is a fundamental task, which integrates the inform...
research
12/29/2022

Learning Multimodal Data Augmentation in Feature Space

The ability to jointly learn from multiple modalities, such as text, aud...
research
04/10/2023

On Robustness in Multimodal Learning

Multimodal learning is defined as learning over multiple heterogeneous i...
research
01/02/2018

Learning Multimodal Word Representation via Dynamic Fusion Methods

Multimodal models have been proven to outperform text-based models on le...
research
05/12/2021

Cross-Modal and Multimodal Data Analysis Based on Functional Mapping of Spectral Descriptors and Manifold Regularization

Multimodal manifold modeling methods extend the spectral geometry-aware ...
research
03/12/2021

Orthogonal Statistical Inference for Multimodal Data Analysis

Multimodal imaging has transformed neuroscience research. While it prese...

Please sign up or login with your details

Forgot password? Click here to reset