PMC text mining subset in BioC: 2.3 million full text articles and growing

04/16/2018
by   Donald C. Comeau, et al.
0

Interest in full text mining biomedical research articles is growing. NCBI provides the PMC Open Access and Author Manuscript sets of articles which are available for text mining. We have made all of these articles available in BioC, an XML and JSON format which is convenient for sharing text, annotations, and relations. These articles are available both via ftp for bulk download and via a Web API for updates or more focused collection. Availability: https://www.ncbi.nlm.nih.gov/research/bionlp/APIs/BioC-PMC/

READ FULL TEXT
research
05/06/2015

Mining Scientific Papers for Bibliometrics: a (very) Brief Survey of Methods and Tools

The Open Access movement in scientific publishing and search engines lik...
research
07/14/2017

Medical Theses and Derivative Articles: Dissemination Of Contents and Publication Patterns

Doctoral theses are an important source of publication in universities, ...
research
03/11/2020

Predicting the Amount of GDPR Fines

The General Data Protection Regulation (GDPR) was enforced in 2018. Afte...
research
07/11/2019

Geographical Distribution of Biomedical Research in the USA and China

We analyze nearly 20 million geocoded PubMed articles with author affili...
research
05/19/2022

Fidyll: A Compiler for Cross-Format Data Stories Explorable Explanations

Narrative visualization is a powerful communicative tool that can take o...
research
11/02/2020

The GDPR Enforcement Fines at Glance

The General Data Protection Regulation (GDPR) came into force in 2018. A...
research
11/05/2020

PubSqueezer: A Text-Mining Web Tool to Transform Unstructured Documents into Structured Data

The amount of scientific papers published every day is daunting and cons...

Please sign up or login with your details

Forgot password? Click here to reset