A transformer-based method for zero and few-shot biomedical named entity recognition

05/05/2023
by   Miloš Košprdić, et al.
0

Supervised named entity recognition (NER) in the biomedical domain is dependent on large sets of annotated texts with the given named entities, whose creation can be time-consuming and expensive. Furthermore, the extraction of new entities often requires conducting additional annotation tasks and retraining the model. To address these challenges, this paper proposes a transformer-based method for zero- and few-shot NER in the biomedical domain. The method is based on transforming the task of multi-class token classification into binary token classification (token contains the searched entity or does not contain the searched entity) and pre-training on a larger amount of datasets and biomedical entities, from where the method can learn semantic relations between the given and potential classes. We have achieved average F1 scores of 35.44 for 10-shot NER, and 79.51 entities with PubMedBERT fine-tuned model. The results demonstrate the effectiveness of the proposed method for recognizing new entities with limited examples, with comparable or better results from the state-of-the-art zero- and few-shot NER methods.

READ FULL TEXT

page 4

page 5

research
05/05/2023

VicunaNER: Zero/Few-shot Named Entity Recognition using Vicuna

Large Language Models (LLMs, e.g., ChatGPT) have shown impressive zero- ...
research
03/29/2023

Zero-shot Clinical Entity Recognition using ChatGPT

In this study, we investigated the potential of ChatGPT, a large languag...
research
11/15/2021

Zero-Shot Learning in Named-Entity Recognition with External Knowledge

A significant shortcoming of current state-of-the-art (SOTA) named-entit...
research
05/27/2021

Neural Entity Recognition with Gazetteer based Fusion

Incorporating external knowledge into Named Entity Recognition (NER) sys...
research
09/28/2021

Template-free Prompt Tuning for Few-shot NER

Prompt-based methods have been successfully applied in sentence-level fe...
research
07/01/2022

Multi-features based Semantic Augmentation Networks for Named Entity Recognition in Threat Intelligence

Extracting cybersecurity entities such as attackers and vulnerabilities ...
research
06/07/2023

Evaluation of ChatGPT on Biomedical Tasks: A Zero-Shot Comparison with Fine-Tuned Generative Transformers

ChatGPT is a large language model developed by OpenAI. Despite its impre...

Please sign up or login with your details

Forgot password? Click here to reset