Portuguese Named Entity Recognition using BERT-CRF

09/23/2019
by   Fábio Souza, et al.
0

Recent advances in language representation using neural networks have made it viable to transfer the learned internal states of a trained model to downstream natural language processing tasks, such as named entity recognition (NER) and question answering. It has been shown that the leverage of pre-trained language models improves the overall performance on many tasks and is highly beneficial when labeled data is scarce. In this work, we employ a pre-trained BERT with Conditional Random Fields (CRF) architecture to the NER task on the Portuguese language, combining the transfer capabilities of BERT with the structured predictions of CRF. We explore feature-based and fine-tuning training strategies for the BERT model. Our fine-tuning approach obtains new state-of-the-art results on the HAREM I dataset, improving the F1-score by 3.2 points on the selective scenario (5 NE classes) and by 3.8 points on the total scenario (10 NE classes).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/19/2020

Beheshti-NER: Persian Named Entity Recognition Using BERT

Named entity recognition is a natural language processing task to recogn...
research
06/14/2021

Can BERT Dig It? – Named Entity Recognition for Information Retrieval in the Archaeology Domain

The amount of archaeological literature is growing rapidly. Until recent...
research
02/26/2020

Detecting Potential Topics In News Using BERT, CRF and Wikipedia

For a news content distribution platform like Dailyhunt, Named Entity Re...
research
03/10/2020

Adaptive Name Entity Recognition under Highly Unbalanced Data

For several purposes in Natural Language Processing (NLP), such as Infor...
research
11/30/2021

Automatic Extraction of Medication Names in Tweets as Named Entity Recognition

Social media posts contain potentially valuable information about medica...
research
11/13/2020

FLERT: Document-Level Features for Named Entity Recognition

Current state-of-the-art approaches for named entity recognition (NER) u...
research
05/28/2021

Weighted Training for Cross-Task Learning

In this paper, we introduce Target-Aware Weighted Training (TAWT), a wei...

Please sign up or login with your details

Forgot password? Click here to reset