Adversarial Learning for Chinese NER from Crowd Annotations

01/16/2018
by   YaoSheng Yang, et al.
0

To quickly obtain new labeled data, we can choose crowdsourcing as an alternative way at lower cost in a short time. But as an exchange, crowd annotations from non-experts may be of lower quality than those from experts. In this paper, we propose an approach to performing crowd annotation learning for Chinese Named Entity Recognition (NER) to make full use of the noisy sequence labels from multiple annotators. Inspired by adversarial learning, our approach uses a common Bi-LSTM and a private Bi-LSTM for representing annotator-generic and -specific information. The annotator-generic information is the common knowledge for entities easily mastered by the crowd. Finally, we build our Chinese NE tagger based on the LSTM-CRF model. In our experiments, we create two data sets for Chinese NER tasks from two domains. The experimental results show that our system achieves better scores than strong baseline systems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/28/2019

Exploiting Multiple Embeddings for Chinese Named Entity Recognition

Identifying the named entities mentioned in text would enrich many seman...
research
05/31/2020

Recognizing Chinese Judicial Named Entity using BiLSTM-CRF

Named entity recognition (NER) plays an essential role in natural langua...
research
01/15/2020

FGN: Fusion Glyph Network for Chinese Named Entity Recognition

Chinese NER is a challenging task. As pictographs, Chinese characters co...
research
09/11/2021

AdaK-NER: An Adaptive Top-K Approach for Named Entity Recognition with Incomplete Annotations

State-of-the-art Named Entity Recognition(NER) models rely heavily on la...
research
08/16/2019

Simplify the Usage of Lexicon in Chinese NER

Recently, many works have tried to utilizing word lexicon to augment the...
research
10/22/2019

IPOD: Corpus of 190,000 Industrial Occupations

Job titles are the most fundamental building blocks for occupational dat...
research
04/22/2022

Identifying Chinese Opinion Expressions with Extremely-Noisy Crowdsourcing Annotations

Recent works of opinion expression identification (OEI) rely heavily on ...

Please sign up or login with your details

Forgot password? Click here to reset