Confidence penalty, annealing Gaussian noise and zoneout for biLSTM-CRF networks for named entity recognition

08/13/2018
by   Antonio Jimeno Yenes, et al.
0

Named entity recognition (NER) is used to identify relevant entities in text. A bidirectional LSTM (long short term memory) encoder with a neural conditional random fields (CRF) decoder (biLSTM-CRF) is the state of the art methodology. In this work, we have done an analysis of several methods that intend to optimize the performance of networks based on this architecture, which in some cases encourage overfitting avoidance. These methods target exploration of parameter space, regularization of LSTMs and penalization of confident output distributions. Results show that the optimization methods improve the performance of the biLSTM-CRF NER baseline system, setting a new state of the art performance for the CoNLL-2003 Spanish set with an F1 of 87.18.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/26/2018

Combining neural and knowledge-based approaches to Named Entity Recognition in Polish

Named entity recognition (NER) is one of the tasks in natural language p...
research
05/26/2018

Connecting Distant Entities with Induction through Conditional Random Fields for Named Entity Recognition: Precursor-Induced CRF

This paper presents a method of designing specific high-order dependency...
research
12/20/2020

A hybrid deep-learning approach for complex biochemical named entity recognition

Named entity recognition (NER) of chemicals and drugs is a critical doma...
research
10/09/2020

Constrained Decoding for Computationally Efficient Named Entity Recognition Taggers

Current state-of-the-art models for named entity recognition (NER) are n...
research
11/19/2020

Persuasive Dialogue Understanding: the Baselines and Negative Results

Persuasion aims at forming one's opinion and action via a series of pers...
research
07/17/2018

Bench-Marking Information Extraction in Semi-Structured Historical Handwritten Records

In this report, we present our findings from benchmarking experiments fo...
research
05/18/2018

Suffix Bidirectional Long Short-Term Memory

Recurrent neural networks have become ubiquitous in computing representa...

Please sign up or login with your details

Forgot password? Click here to reset