Few-shot Named Entity Recognition with Cloze Questions

11/24/2021
by   Valerio La Gatta, et al.
0

Despite the huge and continuous advances in computational linguistics, the lack of annotated data for Named Entity Recognition (NER) is still a challenging issue, especially in low-resource languages and when domain knowledge is required for high-quality annotations. Recent findings in NLP show the effectiveness of cloze-style questions in enabling language models to leverage the knowledge they acquired during the pre-training phase. In our work, we propose a simple and intuitive adaptation of Pattern-Exploiting Training (PET), a recent approach which combines the cloze-questions mechanism and fine-tuning for few-shot learning: the key idea is to rephrase the NER task with patterns. Our approach achieves considerably better performance than standard fine-tuning and comparable or improved results with respect to other few-shot baselines without relying on manually annotated data or distant supervision on three benchmark datasets: NCBI-disease, BC2GM and a private Italian biomedical corpus.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/24/2022

Formulating Few-shot Fine-tuning Towards Language Model Pre-training: A Pilot Study on Named Entity Recognition

Fine-tuning pre-trained language models has recently become a common pra...
research
04/08/2023

MphayaNER: Named Entity Recognition for Tshivenda

Named Entity Recognition (NER) plays a vital role in various Natural Lan...
research
10/16/2020

Coarse-to-Fine Pre-training for Named Entity Recognition

More recently, Named Entity Recognition hasachieved great advances aided...
research
02/05/2022

Adaptive Fine-Tuning of Transformer-Based Language Models for Named Entity Recognition

The current standard approach for fine-tuning transformer-based language...
research
06/28/2022

NERDA-Con: Extending NER models for Continual Learning – Integrating Distinct Tasks and Updating Distribution Shifts

With increasing applications in areas such as biomedical information ext...
research
05/23/2023

BAND: Biomedical Alert News Dataset

Infectious disease outbreaks continue to pose a significant threat to hu...
research
09/29/2022

Named Entity Recognition in Industrial Tables using Tabular Language Models

Specialized transformer-based models for encoding tabular data have gain...

Please sign up or login with your details

Forgot password? Click here to reset