PubGraph: A Large Scale Scientific Temporal Knowledge Graph

02/04/2023
by   Kian Ahrabian, et al.
0

Research publications are the primary vehicle for sharing scientific progress in the form of new discoveries, methods, techniques, and insights. Publications have been studied from the perspectives of both content analysis and bibliometric structure, but a barrier to more comprehensive studies of scientific research is a lack of publicly accessible large-scale data and resources. In this paper, we present PubGraph, a new resource for studying scientific progress that takes the form of a large-scale temporal knowledge graph (KG). It contains more than 432M nodes and 15.49B edges mapped to the popular Wikidata ontology. We extract three KGs with varying sizes from PubGraph to allow experimentation at different scales. Using these KGs, we introduce a new link prediction benchmark for transductive and inductive settings with temporally-aligned training, validation, and testing partitions. Moreover, we develop two new inductive learning methods better suited to PubGraph, operating on unseen nodes without explicit features, scaling to large KGs, and outperforming existing models. Our results demonstrate that structural features of past citations are sufficient to produce high-quality predictions about new publications. We also identify new challenges for KG models, including an adversarial community-based link prediction setting, zero-shot inductive learning, and large-scale learning.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/11/2020

Learning to Extrapolate Knowledge: Transductive Few-shot Out-of-Graph Link Prediction

Many practical graph problems, such as knowledge graph construction and ...
research
11/23/2021

Triple Classification for Scholarly Knowledge Graph Completion

Scholarly Knowledge Graphs (KGs) provide a rich source of structured inf...
research
11/22/2022

BESS: Balanced Entity Sampling and Sharing for Large-Scale Knowledge Graph Completion

We present the award-winning submission to the WikiKG90Mv2 track of OGB-...
research
08/16/2017

Hypotheses generation using link prediction in a bipartite graph

The large volume of scientific publications is likely to have hidden kno...
research
04/02/2023

Improving Few-Shot Inductive Learning on Temporal Knowledge Graphs using Confidence-Augmented Reinforcement Learning

Temporal knowledge graph completion (TKGC) aims to predict the missing l...
research
01/02/2023

IRT2: Inductive Linking and Ranking in Knowledge Graphs of Varying Scale

We address the challenge of building domain-specific knowledge models fo...
research
06/13/2017

A Supervised Approach to Extractive Summarisation of Scientific Papers

Automatic summarisation is a popular approach to reduce a document to it...

Please sign up or login with your details

Forgot password? Click here to reset