Adaptive Name Entity Recognition under Highly Unbalanced Data

03/10/2020
by   Thong Nguyen, et al.
0

For several purposes in Natural Language Processing (NLP), such as Information Extraction, Sentiment Analysis or Chatbot, Named Entity Recognition (NER) holds an important role as it helps to determine and categorize entities in text into predefined groups such as the names of persons, locations, quantities, organizations or percentages, etc. In this report, we present our experiments on a neural architecture composed of a Conditional Random Field (CRF) layer stacked on top of a Bi-directional LSTM (BI-LSTM) layer for solving NER tasks. Besides, we also employ a fusion input of embedding vectors (Glove, BERT), which are pre-trained on the huge corpus to boost the generalization capacity of the model. Unfortunately, due to the heavy unbalanced distribution cross-training data, both approaches just attained a bad performance on less training samples classes. To overcome this challenge, we introduce an add-on classification model to split sentences into two different sets: Weak and Strong classes and then designing a couple of Bi-LSTM-CRF models properly to optimize performance on each set. We evaluated our models on the test set and discovered that our method can improve performance for Weak classes significantly by using a very small data set (approximately 0.45%) compared to the rest classes.

READ FULL TEXT
research
09/27/2017

Application of a Hybrid Bi-LSTM-CRF model to the task of Russian Named Entity Recognition

Named Entity Recognition (NER) is one of the most common tasks of the na...
research
09/23/2019

Portuguese Named Entity Recognition using BERT-CRF

Recent advances in language representation using neural networks have ma...
research
09/24/2018

Deformable Stacked Structure for Named Entity Recognition

Neural architecture for named entity recognition has achieved great succ...
research
02/07/2017

Fast and Accurate Entity Recognition with Iterated Dilated Convolutions

Today when many practitioners run basic NLP on the entire web and large-...
research
03/12/2019

Syllable-based Neural Named Entity Recognition for Myanmar Language

Named Entity Recognition (NER) for Myanmar Language is essential to Myan...
research
05/31/2020

Recognizing Chinese Judicial Named Entity using BiLSTM-CRF

Named entity recognition (NER) plays an essential role in natural langua...
research
01/05/2020

Computationally Efficient NER Taggers with Combined Embeddings and Constrained Decoding

Current State-of-the-Art models in Named Entity Recognition (NER) are ne...

Please sign up or login with your details

Forgot password? Click here to reset