DeepAI AI Chat
Log In Sign Up

Elsevier OA CC-By Corpus

08/03/2020
by   Daniel Kershaw, et al.
Elsevier
0

We introduce the Elsevier OA CC-BY corpus. This is the first open corpus of Scientific Research papers which has a representative sample from across scientific disciplines. This corpus not only includes the full text of the article, but also the metadata of the documents, along with the bibliographic information for each reference.

READ FULL TEXT
11/07/2019

S2ORC: The Semantic Scholar Open Research Corpus

We introduce S2ORC, a large contextual citation graph of English-languag...
11/04/2022

SMAuC – The Scientific Multi-Authorship Corpus

With an ever-growing number of new publications each day, scientific wri...
10/28/2021

Using Text Analytics for Health to Get Meaningful Insights from a Corpus of COVID Scientific Papers

Since the beginning of COVID pandemic, there have been around 700000 sci...
11/16/2022

Galactica: A Large Language Model for Science

Information overload is a major obstacle to scientific progress. The exp...
01/03/2018

The Temple University Hospital Seizure Detection Corpus

We introduce the TUH EEG Seizure Corpus (TUSZ), which is the largest ope...
04/21/2021

Possibilities, Challenges and Limits of a European Charters Corpus (Cartae Europae Medii Aevi - CEMA)

The objective of this paper is to present a meta-corpus of diplomatic do...
11/10/2021

Multimodal Approach for Metadata Extraction from German Scientific Publications

Nowadays, metadata information is often given by the authors themselves ...