LTP: A New Active Learning Strategy for Bert-CRF Based Named Entity Recognition

01/08/2020
by   Mingyi Liu, et al.
0

In recent years, deep learning has achieved great success in many natural language processing tasks including named entity recognition. The shortcoming is that a large amount of manually-annotated data is usually required. Previous studies have demonstrated that both transfer learning and active learning could elaborately reduce the cost of data annotation in terms of their corresponding advantages, but there is still plenty of room for improvement. We assume that the convergence of the two methods can complement with each other, so that the model could be trained more accurately with less labelled data, and active learning method could enhance transfer learning method to accurately select the minimum data samples for iterative learning. However, in real applications we found this approach is challenging because the sample selection of traditional active learning strategy merely depends on the final probability value of its model output, and this makes it quite difficult to evaluate the quality of the selected data samples. In this paper, we first examine traditional active learning strategies in a specific case of BERT-CRF that has been widely used in named entity recognition. Then we propose an uncertainty-based active learning strategy called Lowest Token Probability (LTP) which considers not only the final output but also the intermediate results. We test LTP on multiple datasets, and the experiments show that LTP performs better than traditional strategies (incluing LC and NLC) on both token-level F_1 and sentence-level accuracy, especially in complex imbalanced datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/19/2017

Deep Active Learning for Named Entity Recognition

Deep neural networks have advanced the state of the art in named entity ...
research
08/08/2017

Learning how to Active Learn: A Deep Reinforcement Learning Approach

Active learning aims to select a small subset of data for annotation suc...
research
10/26/2020

Inspecting Sample Reusability for Active Learning

Active Learning (AL) exploits a learning algorithm to selectively sample...
research
11/17/2019

Overcoming Practical Issues of Deep Active Learning and its Applications on Named Entity Recognition

Existing deep active learning algorithms achieve impressive sampling eff...
research
10/01/2021

OPAD: An Optimized Policy-based Active Learning Framework for Document Content Analysis

Documents are central to many business systems, and include forms, repor...
research
08/28/2020

Cost-Quality Adaptive Active Learning for Chinese Clinical Named Entity Recognition

Clinical Named Entity Recognition (CNER) aims to automatically identity ...
research
10/05/2020

SeqMix: Augmenting Active Sequence Labeling via Sequence Mixup

Active learning is an important technique for low-resource sequence labe...

Please sign up or login with your details

Forgot password? Click here to reset