Biomedical NER for the Enterprise with Distillated BERN2 and the Kazu Framework

12/01/2022
by   Wonjin Yoon, et al.
0

In order to assist the drug discovery/development process, pharmaceutical companies often apply biomedical NER and linking techniques over internal and public corpora. Decades of study of the field of BioNLP has produced a plethora of algorithms, systems and datasets. However, our experience has been that no single open source system meets all the requirements of a modern pharmaceutical company. In this work, we describe these requirements according to our experience of the industry, and present Kazu, a highly extensible, scalable open source framework designed to support BioNLP for the pharmaceutical sector. Kazu is a built around a computationally efficient version of the BERN2 NER model (TinyBERN2), and subsequently wraps several other BioNLP technologies into one coherent system. KAZU framework is open-sourced: https://github.com/AstraZeneca/KAZU

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/17/2020

HunFlair: An Easy-to-Use Tool for State-of-the-Art Biomedical Named Entity Recognition

Summary: Named Entity Recognition (NER) is an important step in biomedic...
research
01/29/2019

Revised JNLPBA Corpus: A Revised Version of Biomedical NER Corpus for Relation Extraction Task

The advancement of biomedical named entity recognition (BNER) and biomed...
research
07/17/2019

LinTO : Assistant vocal open-source respectueux des données personnelles pour les réunions d'entreprise

This paper presents the first results of the PIA "Grands Défis du Numéri...
research
10/24/2022

Enhancing Label Consistency on Document-level Named Entity Recognition

Named entity recognition (NER) is a fundamental part of extracting infor...
research
06/28/2022

NERDA-Con: Extending NER models for Continual Learning – Integrating Distinct Tasks and Updating Distribution Shifts

With increasing applications in areas such as biomedical information ext...
research
07/30/2018

Automating Requirements Traceability: Two Decades of Learning from KDD

This paper summarizes our experience with using Knowledge Discovery in D...
research
06/02/2022

Sparx: Distributed Outlier Detection at Scale

There is no shortage of outlier detection (OD) algorithms in the literat...

Please sign up or login with your details

Forgot password? Click here to reset