TURNER: The Uncertainty-based Retrieval Framework for Chinese NER

02/18/2022
by   Zhichao Geng, et al.
0

Chinese NER is a difficult undertaking due to the ambiguity of Chinese characters and the absence of word boundaries. Previous work on Chinese NER focus on lexicon-based methods to introduce boundary information and reduce out-of-vocabulary (OOV) cases during prediction. However, it is expensive to obtain and dynamically maintain high-quality lexicons in specific domains, which motivates us to utilize more general knowledge resources, e.g., search engines. In this paper, we propose TURNER: The Uncertainty-based Retrieval framework for Chinese NER. The idea behind TURNER is to imitate human behavior: we frequently retrieve auxiliary knowledge as assistance when encountering an unknown or uncertain entity. To improve the efficiency and effectiveness of retrieval, we first propose two types of uncertainty sampling methods for selecting the most ambiguous entity-level uncertain components of the input text. Then, the Knowledge Fusion Model re-predict the uncertain samples by combining retrieved knowledge. Experiments on four benchmark datasets demonstrate TURNER's effectiveness. TURNER outperforms existing lexicon-based approaches and achieves the new SOTA.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/14/2020

Incorporating Uncertain Segmentation Information into Chinese NER for Social Media Text

Chinese word segmentation is necessary to provide word-level information...
research
10/23/2022

Improving Chinese Named Entity Recognition by Search Engine Augmentation

Compared with English, Chinese suffers from more grammatical ambiguities...
research
01/15/2020

FGN: Fusion Glyph Network for Chinese Named Entity Recognition

Chinese NER is a challenging task. As pictographs, Chinese characters co...
research
06/27/2023

DMNER: Biomedical Entity Recognition by Detection and Matching

Biomedical named entity recognition (BNER) serves as the foundation for ...
research
05/05/2018

Chinese NER Using Lattice LSTM

We investigate a lattice-structured LSTM model for Chinese NER, which en...
research
07/16/2020

SLK-NER: Exploiting Second-order Lexicon Knowledge for Chinese NER

Although character-based models using lexicon have achieved promising re...
research
08/27/2022

Domain-Specific NER via Retrieving Correlated Samples

Successful Machine Learning based Named Entity Recognition models could ...

Please sign up or login with your details

Forgot password? Click here to reset