Enhancing Multimodal Entity and Relation Extraction with Variational Information Bottleneck

04/05/2023
by   Shiyao Cui, et al.
0

This paper studies the multimodal named entity recognition (MNER) and multimodal relation extraction (MRE), which are important for multimedia social platform analysis. The core of MNER and MRE lies in incorporating evident visual information to enhance textual semantics, where two issues inherently demand investigations. The first issue is modality-noise, where the task-irrelevant information in each modality may be noises misleading the task prediction. The second issue is modality-gap, where representations from different modalities are inconsistent, preventing from building the semantic alignment between the text and image. To address these issues, we propose a novel method for MNER and MRE by Multi-Modal representation learning with Information Bottleneck (MMIB). For the first issue, a refinement-regularizer probes the information-bottleneck principle to balance the predictive evidence and noisy information, yielding expressive representations for prediction. For the second issue, an alignment-regularizer is proposed, where a mutual information-based item works in a contrastive manner to regularize the consistent text-image representations. To our best knowledge, we are the first to explore variational IB estimation for MNER and MRE. Experiments show that MMIB achieves the state-of-the-art performances on three public benchmarks.

READ FULL TEXT

page 1

page 8

page 9

research
11/28/2022

Joint Multimodal Entity-Relation Extraction Based on Edge-enhanced Graph Alignment Network and Word-pair Relation Tagging

Multimodal named entity recognition (MNER) and multimodal relation extra...
research
05/15/2023

A Novel Framework for Multimodal Named Entity Recognition with Multi-level Alignments

Mining structured knowledge from tweets using named entity recognition (...
research
05/07/2022

Good Visual Guidance Makes A Better Extractor: Hierarchical Visual Prefix for Multimodal Entity and Relation Extraction

Multimodal named entity recognition and relation extraction (MNER and MR...
research
05/19/2023

Information Screening whilst Exploiting! Multimodal Relation Extraction with Feature Denoising and Multimodal Topic Modeling

Existing research on multimodal relation extraction (MRE) faces two co-e...
research
12/03/2022

Named Entity and Relation Extraction with Multi-Modal Retrieval

Multi-modal named entity recognition (NER) and relation extraction (RE) ...
research
06/25/2023

Chain-of-Thought Prompt Distillation for Multimodal Named Entity and Multimodal Relation Extraction

Multimodal Named Entity Recognition (MNER) and Multimodal Relation Extra...
research
11/11/2022

Unimodal and Multimodal Representation Training for Relation Extraction

Multimodal integration of text, layout and visual information has achiev...

Please sign up or login with your details

Forgot password? Click here to reset