Learning In-context Learning for Named Entity Recognition

05/18/2023
βˆ™
by   Jiawei Chen, et al.
βˆ™
0
βˆ™

Named entity recognition in real-world applications suffers from the diversity of entity types, the emergence of new entity types, and the lack of high-quality annotations. To address the above problems, this paper proposes an in-context learning-based NER approach, which can effectively inject in-context NER ability into PLMs and recognize entities of novel types on-the-fly using only a few demonstrative instances. Specifically, we model PLMs as a meta-function Ξ»_ π’Ύπ“ƒπ“ˆπ“‰π“‡π“Šπ’Έπ“‰π’Ύβ„΄π“ƒ, π’Ήβ„―π“‚β„΄π“ƒπ“ˆπ“‰π“‡π’Άπ“‰π’Ύβ„΄π“ƒπ“ˆ, 𝓉ℯ𝓍𝓉. β„³, and a new entity extractor can be implicitly constructed by applying new instruction and demonstrations to PLMs, i.e., (Ξ» . β„³)(instruction, demonstrations) β†’ β„± where β„± will be a new entity extractor, i.e., β„±: text β†’ entities. To inject the above in-context NER ability into PLMs, we propose a meta-function pre-training algorithm, which pre-trains PLMs by comparing the (instruction, demonstration)-initialized extractor with a surrogate golden extractor. Experimental results on 4 few-shot NER datasets show that our method can effectively inject in-context NER ability into PLMs and significantly outperforms the PLMs+fine-tuning counterparts.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
βˆ™ 03/08/2022

InstructionNER: A Multi-Task Instruction-Based Generative Framework for Few-shot NER

Recently, prompt-based methods have achieved significant performance in ...
research
βˆ™ 08/07/2023

UniversalNER: Targeted Distillation from Large Language Models for Open Named Entity Recognition

Large language models (LLMs) have demonstrated remarkable generalizabili...
research
βˆ™ 04/25/2020

A Rigourous Study on Named Entity Recognition: Can Fine-tuning Pretrained Model Lead to the Promised Land?

Fine-tuning pretrained model has achieved promising performance on stand...
research
βˆ™ 03/23/2022

Few-shot Named Entity Recognition with Self-describing Networks

Few-shot NER needs to effectively capture information from limited insta...
research
βˆ™ 03/13/2023

A Human Subject Study of Named Entity Recognition (NER) in Conversational Music Recommendation Queries

We conducted a human subject study of named entity recognition on a nois...
research
βˆ™ 03/29/2019

CUTIE: Learning to Understand Documents with Convolutional Universal Text Information Extractor

Extracting key information from documents, such as receipts or invoices,...
research
βˆ™ 10/10/2022

Learning "O" Helps for Learning More: Handling the Concealed Entity Problem for Class-incremental NER

As the categories of named entities rapidly increase in real-world appli...

Please sign up or login with your details

Forgot password? Click here to reset