CoCon: A Data Set on Combined Contextualized Research Artifact Use

03/27/2023
by   Tarek Saier, et al.
0

In the wake of information overload in academia, methodologies and systems for search, recommendation, and prediction to aid researchers in identifying relevant research are actively studied and developed. Existing work, however, is limited in terms of granularity, focusing only on the level of papers or a single type of artifact, such as data sets. To enable more holistic analyses and systems dealing with academic publications and their content, we propose CoCon, a large scholarly data set reflecting the combined use of research artifacts, contextualized in academic publications' full-text. Our data set comprises 35 k artifacts (data sets, methods, models, and tasks) and 340 k publications. We additionally formalize a link prediction task for "combined research artifact use prediction" and provide code to utilize analyses of and the development of ML applications on our data. All data and code is publicly available at https://github.com/IllDepence/contextgraph.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/27/2023

unarXive 2022: All arXiv Publications Pre-Processed for NLP, Including Structured Full-Text and Citation Network

Large-scale data sets on scholarly publications are the basis for a vari...
research
03/16/2018

Link prediction for interdisciplinary collaboration via co-authorship network

We analyse the Publication and Research (PURE) data set of University of...
research
03/29/2020

Elastic Coupled Co-clustering for Single-Cell Genomic Data

The recent advances in single-cell technologies have enabled us to profi...
research
02/23/2021

Data Engineering for Everyone

Data engineering is one of the fastest-growing fields within machine lea...
research
06/20/2018

Developing a Temporal Bibliographic Data Set for Entity Resolution

Entity resolution is the process of identifying groups of records within...
research
01/24/2019

Readership Data and Research Impact

Reading academic publications is a key scholarly activity. Scholars acce...

Please sign up or login with your details

Forgot password? Click here to reset