reproducing "ner and pos when nothing is capitalized"

09/17/2021
by   Andreas Kuster, et al.
0

Capitalization is an important feature in many NLP tasks such as Named Entity Recognition (NER) or Part of Speech Tagging (POS). We are trying to reproduce results of paper which shows how to mitigate a significant performance drop when casing is mismatched between training and testing data. In particular we show that lowercasing 50 matching the claims of the original paper. We also show that we got slightly lower performance in almost all experiments we have tried to reproduce, suggesting that there might be some hidden factors impacting our performance. Lastly, we make all of our work available in a public github repository.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/12/2018

A Feature-Rich Vietnamese Named-Entity Recognition Model

In this paper, we present a feature-based named-entity recognition (NER)...
research
04/08/2023

WikiGoldSK: Annotated Dataset, Baselines and Few-Shot Learning Experiments for Slovak Named Entity Recognition

Named Entity Recognition (NER) is a fundamental NLP tasks with a wide ra...
research
09/28/2019

Named Entity Recognition System for Sindhi Language

Named Entity Recognition (NER) System aims to extract the existing infor...
research
03/27/2019

ner and pos when nothing is capitalized

For those languages which use it, capitalization is an important signal ...
research
01/12/2020

Rethinking Generalization of Neural Models: A Named Entity Recognition Case Study

While neural network-based models have achieved impressive performance o...
research
07/23/2020

[not Rp] Reproducibility of 'Poincare dodecahedral space parameter estimates'

Is a scientific research paper based on (i) public, online observational...
research
10/27/2021

Towards Realistic Single-Task Continuous Learning Research for NER

There is an increasing interest in continuous learning (CL), as data pri...

Please sign up or login with your details

Forgot password? Click here to reset