A Feature-Rich Vietnamese Named-Entity Recognition Model

03/12/2018
by   Pham Quang Nhat Minh, et al.
0

In this paper, we present a feature-based named-entity recognition (NER) model that achieves the start-of-the-art accuracy for Vietnamese language. We combine word, word-shape features, PoS, chunk, Brown-cluster-based features, and word-embedding-based features in the Conditional Random Fields (CRF) model. We also explore the effects of word segmentation, PoS tagging, and chunking results of many popular Vietnamese NLP toolkits on the accuracy of the proposed feature-based NER model. Up to now, our work is the first work that systematically performs an extrinsic evaluation of basic Vietnamese NLP toolkits on the downstream NER task. Experimental results show that while automatically-generated word segmentation is useful, PoS and chunking information generated by Vietnamese NLP tools does not show their benefits for the proposed feature-based NER model.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/03/2021

Empirical Study of Named Entity Recognition Performance Using Distribution-aware Word Embedding

With the fast development of Deep Learning techniques, Named Entity Reco...
research
03/22/2018

A Feature-Based Model for Nested Named-Entity Recognition at VLSP-2018 NER Evaluation Campaign

In this report, we describe our participant named-entity recognition sys...
research
08/11/2017

Unified Neural Architecture for Drug, Disease and Clinical Entity Recognition

Most existing methods for biomedical entity recognition task rely on exp...
research
04/19/2016

Exploring Segment Representations for Neural Segmentation Models

Many natural language processing (NLP) tasks can be generalized into seg...
research
09/17/2021

reproducing "ner and pos when nothing is capitalized"

Capitalization is an important feature in many NLP tasks such as Named E...
research
04/16/2018

Arabic Named Entity Recognition using Word Representations

Recent work has shown the effectiveness of the word representations feat...
research
05/10/2018

Hybrid semi-Markov CRF for Neural Sequence Labeling

This paper proposes hybrid semi-Markov conditional random fields (SCRFs)...

Please sign up or login with your details

Forgot password? Click here to reset