Distilling Causal Effect from Miscellaneous Other-Class for Continual Named Entity Recognition

10/08/2022
by   Junhao Zheng, et al.
0

Continual Learning for Named Entity Recognition (CL-NER) aims to learn a growing number of entity types over time from a stream of data. However, simply learning Other-Class in the same way as new entity types amplifies the catastrophic forgetting and leads to a substantial performance drop. The main cause behind this is that Other-Class samples usually contain old entity types, and the old knowledge in these Other-Class samples is not preserved properly. Thanks to the causal inference, we identify that the forgetting is caused by the missing causal effect from the old data. To this end, we propose a unified causal framework to retrieve the causality from both new entity types and Other-Class. Furthermore, we apply curriculum learning to mitigate the impact of label noise and introduce a self-adaptive weight for balancing the causal effects between new entity types and Other-Class. Experimental results on three benchmark datasets show that our method outperforms the state-of-the-art method by a large margin. Moreover, our method can be combined with the existing state-of-the-art methods to improve the performance in CL-NER

READ FULL TEXT

page 4

page 12

research
03/02/2021

Distilling Causal Effect of Data in Class-Incremental Learning

We propose a causal framework to explain the catastrophic forgetting in ...
research
05/03/2023

Causal Interventions-based Few-Shot Named Entity Recognition

Few-shot named entity recognition (NER) systems aims at recognizing new ...
research
08/17/2023

Task Relation Distillation and Prototypical Pseudo Label for Incremental Named Entity Recognition

Incremental Named Entity Recognition (INER) involves the sequential lear...
research
10/10/2022

Learning "O" Helps for Learning More: Handling the Concealed Entity Problem for Class-incremental NER

As the categories of named entities rapidly increase in real-world appli...
research
02/23/2023

A Neural Span-Based Continual Named Entity Recognition Model

Named Entity Recognition (NER) models capable of Continual Learning (CL)...
research
05/29/2023

ContrastNER: Contrastive-based Prompt Tuning for Few-shot NER

Prompt-based language models have produced encouraging results in numero...
research
06/17/2021

De-biasing Distantly Supervised Named Entity Recognition via Causal Intervention

Distant supervision tackles the data bottleneck in NER by automatically ...

Please sign up or login with your details

Forgot password? Click here to reset