Multimodal Relation Extraction with Cross-Modal Retrieval and Synthesis

05/25/2023
by   Xuming Hu, et al.
0

Multimodal relation extraction (MRE) is the task of identifying the semantic relationships between two entities based on the context of the sentence image pair. Existing retrieval-augmented approaches mainly focused on modeling the retrieved textual knowledge, but this may not be able to accurately identify complex relations. To improve the prediction, this research proposes to retrieve textual and visual evidence based on the object, sentence, and whole image. We further develop a novel approach to synthesize the object-level, image-level, and sentence-level information for better reasoning between the same and different modalities. Extensive experiments and analyses show that the proposed method is able to effectively select and compare evidence across modalities and significantly outperforms state-of-the-art models.

READ FULL TEXT

page 1

page 2

research
12/09/2019

Effective Attention Modeling for Neural Relation Extraction

Relation extraction is the task of determining the relation between two ...
research
10/26/2022

ReSel: N-ary Relation Extraction from Scientific Text and Tables by Learning to Retrieve and Select

We study the problem of extracting N-ary relation tuples from scientific...
research
05/19/2023

Information Screening whilst Exploiting! Multimodal Relation Extraction with Feature Denoising and Multimodal Topic Modeling

Existing research on multimodal relation extraction (MRE) faces two co-e...
research
10/17/2022

Cross-modal Semantic Enhanced Interaction for Image-Sentence Retrieval

Image-sentence retrieval has attracted extensive research attention in m...
research
11/28/2022

Joint Multimodal Entity-Relation Extraction Based on Edge-enhanced Graph Alignment Network and Word-pair Relation Tagging

Multimodal named entity recognition (MNER) and multimodal relation extra...
research
12/21/2022

Multi-hop Evidence Retrieval for Cross-document Relation Extraction

Relation Extraction (RE) has been extended to cross-document scenarios b...
research
07/01/2021

Multimodal Graph-based Transformer Framework for Biomedical Relation Extraction

The recent advancement of pre-trained Transformer models has propelled t...

Please sign up or login with your details

Forgot password? Click here to reset