HAMNER: Headword Amplified Multi-span Distantly Supervised Method for Domain Specific Named Entity Recognition

12/03/2019
by   Shifeng Liu, et al.
0

To tackle Named Entity Recognition (NER) tasks, supervised methods need to obtain sufficient cleanly annotated data, which is labor and time consuming. On the contrary, distantly supervised methods acquire automatically annotated data using dictionaries to alleviate this requirement. Unfortunately, dictionaries hinder the effectiveness of distantly supervised methods for NER due to its limited coverage, especially in specific domains. In this paper, we aim at the limitations of the dictionary usage and mention boundary detection. We generalize the distant supervision by extending the dictionary with headword based non-exact matching. We apply a function to better weight the matched entity mentions. We propose a span-level model, which classifies all the possible spans then infers the selected spans with a proposed dynamic programming algorithm. Experiments on all three benchmark datasets demonstrate that our method outperforms previous state-of-the-art distantly supervised methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/22/2023

FiNER: Financial Named Entity Recognition Dataset and Weak-Supervision Model

The development of annotated datasets over the 21st century has helped u...
research
06/17/2021

De-biasing Distantly Supervised Named Entity Recognition via Causal Intervention

Distant supervision tackles the data bottleneck in NER by automatically ...
research
09/10/2018

Learning Named Entity Tagger using Domain-Specific Dictionary

Recent advances in deep neural models allow us to build reliable named e...
research
06/04/2019

Distantly Supervised Named Entity Recognition using Positive-Unlabeled Learning

In this work, we explore the way to perform named entity recognition (NE...
research
05/22/2023

Biomedical Named Entity Recognition via Dictionary-based Synonym Generalization

Biomedical named entity recognition is one of the core tasks in biomedic...
research
10/14/2022

Automatic Creation of Named Entity Recognition Datasets by Querying Phrase Representations

Most weakly supervised named entity recognition (NER) models rely on dom...
research
05/06/2023

SANTA: Separate Strategies for Inaccurate and Incomplete Annotation Noise in Distantly-Supervised Named Entity Recognition

Distantly-Supervised Named Entity Recognition effectively alleviates the...

Please sign up or login with your details

Forgot password? Click here to reset