Biomedical and Clinical English Model Packages in the Stanza Python NLP Library

07/29/2020
by   Yuhao Zhang, et al.
0

We introduce biomedical and clinical English model packages for the Stanza Python NLP library. These packages offer accurate syntactic analysis and named entity recognition capabilities for biomedical and clinical text, by combining Stanza's fully neural architecture with a wide variety of open datasets as well as large-scale unsupervised biomedical and clinical text data. We show via extensive experiments that our packages achieve syntactic analysis and named entity recognition performance that is on par with or surpasses state-of-the-art results. We further show that these models do not compromise speed compared to existing toolkits when GPU acceleration is available, and are made easy to download and use with Stanza's Python interface. A demonstration of our packages is available at: http://stanza.run/bio.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/27/2023

CamemBERT-bio: a Tasty French Language Model Better for your Health

Clinical data in hospitals are increasingly accessible for research thro...
research
11/12/2020

Biomedical Named Entity Recognition at Scale

Named entity recognition (NER) is a widely applicable natural language p...
research
09/22/2018

A Byte-sized Approach to Named Entity Recognition

In biomedical literature, it is common for entity boundaries to not alig...
research
12/07/2020

Improving Clinical Document Understanding on COVID-19 Research with Spark NLP

Following the global COVID-19 pandemic, the number of scientific papers ...
research
05/20/2020

BlaBla: Linguistic Feature Extraction for Clinical Analysis in Multiple Languages

We introduce BlaBla, an open-source Python library for extracting lingui...
research
12/25/2021

Deeper Clinical Document Understanding Using Relation Extraction

The surging amount of biomedical literature digital clinical records...
research
05/15/2023

Comparing Variation in Tokenizer Outputs Using a Series of Problematic and Challenging Biomedical Sentences

Background Objective: Biomedical text data are increasingly availabl...

Please sign up or login with your details

Forgot password? Click here to reset