SemOpenAlex: The Scientific Landscape in 26 Billion RDF Triples

08/07/2023
by   Michael Färber, et al.
0

We present SemOpenAlex, an extensive RDF knowledge graph that contains over 26 billion triples about scientific publications and their associated entities, such as authors, institutions, journals, and concepts. SemOpenAlex is licensed under CC0, providing free and open access to the data. We offer the data through multiple channels, including RDF dump files, a SPARQL endpoint, and as a data source in the Linked Open Data cloud, complete with resolvable URIs and links to other data sources. Moreover, we provide embeddings for knowledge graph entities using high-performance computing. SemOpenAlex enables a broad range of use-case scenarios, such as exploratory semantic search via our website, large-scale scientific impact quantification, and other forms of scholarly big data analytics within and across scientific disciplines. Additionally, it enables academic recommender systems, such as recommending collaborators, publications, and venues, including explainability capabilities. Finally, SemOpenAlex can serve for RDF query optimization benchmarks, creating scholarly knowledge-guided language models, and as a hub for semantic scientific publishing.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/17/2022

EMAKG: An Enhanced Version Of The Microsoft Academic Knowledge Graph

Scholarly knowledge graphs are valuable sources of information in severa...
research
06/27/2023

Challenges and Opportunities for RISC-V Architectures towards Genomics-based Workloads

The use of large-scale supercomputing architectures is a hard requiremen...
research
01/24/2023

The Semantic Scholar Open Data Platform

The volume of scientific output is creating an urgent need for automated...
research
05/04/2022

OpenAlex: A fully-open index of scholarly works, authors, venues, institutions, and concepts

OpenAlex is a new, fully-open scientific knowledge graph (SKG), launched...
research
11/27/2020

CybergeoNetworks, an interactive application for the geographical and semantic analysis of scientific publications

The increase in the number of publications has made more difficult for a...
research
04/22/2022

S2AMP: A High-Coverage Dataset of Scholarly Mentorship Inferred from Publications

Mentorship is a critical component of academia, but is not as visible as...
research
12/10/2016

Data Curation APIs

Understanding and analyzing big data is firmly recognized as a powerful ...

Please sign up or login with your details

Forgot password? Click here to reset