An Unsupervised Sampling Approach for Image-Sentence Matching Using Document-Level Structural Information

03/21/2021
by   Zejun Li, et al.
0

In this paper, we focus on the problem of unsupervised image-sentence matching. Existing research explores to utilize document-level structural information to sample positive and negative instances for model training. Although the approach achieves positive results, it introduces a sampling bias and fails to distinguish instances with high semantic similarity. To alleviate the bias, we propose a new sampling strategy to select additional intra-document image-sentence pairs as positive or negative samples. Furthermore, to recognize the complex pattern in intra-document samples, we propose a Transformer based model to capture fine-grained features and implicitly construct a graph for each document, where concepts in a document are introduced to bridge the representation learning of images and sentences in the context of a document. Experimental results show the effectiveness of our approach to alleviate the bias and learn well-aligned multimodal representations.

READ FULL TEXT
research
07/05/2023

Graph Contrastive Topic Model

Existing NTMs with contrastive learning suffer from the sample bias prob...
research
09/14/2023

DebCSE: Rethinking Unsupervised Contrastive Sentence Embedding Learning in the Debiasing Perspective

Several prior studies have suggested that word frequency biases can caus...
research
09/09/2021

ESimCSE: Enhanced Sample Building Method for Contrastive Learning of Unsupervised Sentence Embedding

Contrastive learning has been attracting much attention for learning uns...
research
11/09/2022

Unsupervised Extractive Summarization with Heterogeneous Graph Embeddings for Chinese Document

In the scenario of unsupervised extractive summarization, learning high-...
research
11/16/2021

Multi-Vector Models with Textual Guidance for Fine-Grained Scientific Document Similarity

We present Aspire, a new scientific document similarity model based on m...
research
05/24/2023

SUVR: A Search-based Approach to Unsupervised Visual Representation Learning

Unsupervised learning has grown in popularity because of the difficulty ...
research
12/12/2020

SenSeNet: Neural Keyphrase Generation with Document Structure

Keyphrase Generation (KG) is the task of generating central topics from ...

Please sign up or login with your details

Forgot password? Click here to reset