A Deep Learning approach for Hindi Named Entity Recognition

11/05/2019
by   Bansi Shah, et al.
0

Named Entity Recognition is one of the most important text processing requirement in many NLP tasks. In this paper we use a deep architecture to accomplish the task of recognizing named entities in a given Hindi text sentence. Bidirectional Long Short Term Memory (BiLSTM) based techniques have been used for NER task in literature. In this paper, we first tune BiLSTM low-resource scenario to work for Hindi NER and propose two enhancements namely (a) de-noising auto-encoder (DAE) LSTM and (b) conditioning LSTM which show improvement in NER task compared to the BiLSTM approach. We use pre-trained word embedding to represent the words in the corpus, and the NER tags of the words are as defined by the used annotated corpora. Experiments have been performed to analyze the performance of different word embeddings and batch sizes which is essential for training deep models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/29/2017

The Importance of Automatic Syntactic Features in Vietnamese Named Entity Recognition

This paper presents a state-of-the-art system for Vietnamese Named Entit...
research
11/10/2019

TENER: Adapting Transformer Encoder for Named Entity Recognition

The Bidirectional long short-term memory networks (BiLSTM) have been wid...
research
11/10/2019

TENER: Adapting Transformer Encoder for Name Entity Recognition

The Bidirectional long short-term memory networks (BiLSTM) have been wid...
research
09/02/2020

ASTRAL: Adversarial Trained LSTM-CNN for Named Entity Recognition

Named Entity Recognition (NER) is a challenging task that extracts named...
research
09/27/2016

Modelling Radiological Language with Bidirectional Long Short-Term Memory Networks

Motivated by the need to automate medical information extraction from fr...
research
06/07/2018

Embedding Transfer for Low-Resource Medical Named Entity Recognition: A Case Study on Patient Mobility

Functioning is gaining recognition as an important indicator of global h...
research
10/06/2016

A New Data Representation Based on Training Data Characteristics to Extract Drug Named-Entity in Medical Text

One essential task in information extraction from the medical corpus is ...

Please sign up or login with your details

Forgot password? Click here to reset