Resource-Size matters: Improving Neural Named Entity Recognition with Optimized Large Corpora

07/26/2018
by   Sajawel Ahmed, et al.
0

This study improves the performance of neural named entity recognition by a margin of up to 11 German, thereby outperforming existing baselines and establishing a new state-of-the-art on each single open-source dataset. Rather than designing deeper and wider hybrid neural architectures, we gather all available resources and perform a detailed optimization and grammar-dependent morphological processing consisting of lemmatization and part-of-speech tagging prior to exposing the raw data to any training process. We test our approach in a threefold monolingual experimental setup of a) single, b) joint, and c) optimized training and shed light on the dependency of downstream-tasks on the size of corpora used to compute word embeddings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/01/2017

Morphological Embeddings for Named Entity Recognition in Morphologically Rich Languages

In this work, we present new state-of-the-art results of 93.59, for Turk...
research
06/29/2020

Improving Sequence Tagging for Vietnamese Text Using Transformer-based Neural Models

This paper describes our study on using mutilingual BERT embeddings and ...
research
08/27/2019

A Morpho-Syntactically Informed LSTM-CRF Model for Named Entity Recognition

We propose a morphologically informed model for named entity recognition...
research
01/28/2022

Towards a Broad Coverage Named Entity Resource: A Data-Efficient Approach for Many Diverse Languages

Parallel corpora are ideal for extracting a multilingual named entity (M...
research
03/18/2020

Distant Supervision and Noisy Label Learning for Low Resource Named Entity Recognition: A Study on Hausa and Yorùbá

The lack of labeled training data has limited the development of natural...
research
03/21/2022

Neural Token Segmentation for High Token-Internal Complexity

Tokenizing raw texts into word units is an essential pre-processing step...

Please sign up or login with your details

Forgot password? Click here to reset