GORC: A large contextual citation graph of academic papers

11/07/2019
by   Kyle Lo, et al.
0

We introduce the Semantic Scholar Graph of References in Context (GORC), a large contextual citation graph of 81.1M academic publications, including parsed full text for 8.1M open access papers, across broad domains of science. Each paper is represented with rich paper metadata (title, authors, abstract, etc.), and where available: cleaned full text, section headers, figure and table captions, and parsed bibliography entries. In-line citation mentions in full text are linked to their corresponding bibliography entries, which are in turn linked to in-corpus cited papers, forming the edges of a contextual citation graph. To our knowledge, this is the largest publicly available contextual citation graph; the full text alone is the largest parsed academic text corpus publicly available. We demonstrate the ability to identify similar papers using these citation contexts and propose several applications for language modeling and citation-related tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/07/2019

S2ORC: The Semantic Scholar Open Research Corpus

We introduce S2ORC, a large contextual citation graph of English-languag...
research
03/27/2019

Highly cited references in PLOS ONE and their in-text usage over time

In this article, we describe highly cited publications in a PLOS ONE ful...
research
03/19/2019

ReviewerNet: Visualizing Citation and Authorship Relations for Finding Reviewers

We propose ReviewerNet, an online, interactive visualization system aime...
research
04/06/2021

Structured Citation Trend Prediction Using Graph Neural Networks

Academic citation graphs represent citation relationships between public...
research
04/30/2019

On the Use of ArXiv as a Dataset

The arXiv has collected 1.5 million pre-print articles over 28 years, ho...
research
01/24/2023

The Semantic Scholar Open Data Platform

The volume of scientific output is creating an urgent need for automated...
research
11/17/2022

One Venue, Two Conferences: The Separation of Chinese and American Citation Networks

At NeurIPS, American and Chinese institutions cite papers from each othe...

Please sign up or login with your details

Forgot password? Click here to reset