Constrained Decoding for Computationally Efficient Named Entity Recognition Taggers

10/09/2020
by   Brian Lester, et al.
0

Current state-of-the-art models for named entity recognition (NER) are neural models with a conditional random field (CRF) as the final layer. Entities are represented as per-token labels with a special structure in order to decode them into spans. Current work eschews prior knowledge of how the span encoding scheme works and relies on the CRF learning which transitions are illegal and which are not to facilitate global coherence. We find that by constraining the output to suppress illegal transitions we can train a tagger with a cross-entropy loss twice as fast as a CRF with differences in F1 that are statistically insignificant, effectively eliminating the need for a CRF. We analyze the dynamics of tag co-occurrence to explain when these constraints are most effective and provide open source implementations of our tagger in both PyTorch and TensorFlow.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/05/2020

Computationally Efficient NER Taggers with Combined Embeddings and Constrained Decoding

Current State-of-the-Art models in Named Entity Recognition (NER) are ne...
research
08/13/2018

Confidence penalty, annealing Gaussian noise and zoneout for biLSTM-CRF networks for named entity recognition

Named entity recognition (NER) is used to identify relevant entities in ...
research
05/26/2018

Connecting Distant Entities with Induction through Conditional Random Fields for Named Entity Recognition: Precursor-Induced CRF

This paper presents a method of designing specific high-order dependency...
research
05/29/2017

Supervised Complementary Entity Recognition with Augmented Key-value Pairs of Knowledge

Extracting opinion targets is an important task in sentiment analysis on...
research
04/08/2020

Self-Attention Gazetteer Embeddings for Named-Entity Recognition

Recent attempts to ingest external knowledge into neural models for name...
research
09/11/2020

Investigating Bi-LSTM and CRF with POS Tag Embedding for Indonesian Named Entity Tagger

Researches on Indonesian named entity (NE) tagger have been conducted si...
research
08/27/2018

Fast and Accurate Recognition of Chinese Clinical Named Entities with Residual Dilated Convolutions

Clinical Named Entity Recognition (CNER) aims to identify and classify c...

Please sign up or login with your details

Forgot password? Click here to reset