Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings

02/14/2022
by   Malte Ostendorff, et al.
10

Learning scientific document representations can be substantially improved through contrastive learning objectives, where the challenge lies in creating positive and negative training samples that encode the desired similarity semantics. Prior work relies on discrete citation relations to generate contrast samples. However, discrete citations enforce a hard cut-off to similarity. This is counter-intuitive to similarity-based learning, and ignores that scientific papers can be very similar despite lacking a direct citation - a core problem of finding related research. Instead, we use controlled nearest neighbor sampling over citation graph embeddings for contrastive learning. This control allows us to learn continuous similarity, to sample hard-to-learn negatives and positives, and also to avoid collisions between negative and positive samples by controlling the sampling margin between them. The resulting method SciNCL outperforms the state-of-the-art on the SciDocs benchmark. Furthermore, we demonstrate that it can train (or tune) models sample-efficiently, and that it can be combined with recent training-efficient methods. Perhaps surprisingly, even training a general-domain language model this way outperforms baselines pretrained in-domain.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/01/2022

Generating Counterfactual Hard Negative Samples for Graph Contrastive Learning

Graph contrastive learning has emerged as a powerful tool for unsupervis...
research
06/06/2022

Improving Contrastive Learning of Sentence Embeddings with Case-Augmented Positives and Retrieved Negatives

Following SimCSE, contrastive learning based methods have achieved the s...
research
07/05/2023

Graph Contrastive Topic Model

Existing NTMs with contrastive learning suffer from the sample bias prob...
research
12/16/2022

Hard Sample Aware Network for Contrastive Deep Graph Clustering

Contrastive deep graph clustering, which aims to divide nodes into disjo...
research
04/16/2023

H2CGL: Modeling Dynamics of Citation Network for Impact Prediction

The potential impact of a paper is often quantified by how many citation...
research
04/25/2023

CitePrompt: Using Prompts to Identify Citation Intent in Scientific Papers

Citations in scientific papers not only help us trace the intellectual l...
research
08/10/2020

The Role of Positive and Negative Citations in Scientific Evaluation

Quantifying the impact of scientific papers objectively is crucial for r...

Please sign up or login with your details

Forgot password? Click here to reset