Named Entity Analysis and Extraction with Uncommon Words

10/16/2018
by   Xiaoshi Zhong, et al.
0

Most previous research treats named entity extraction and classification as an end-to-end task. We argue that the two sub-tasks should be addressed separately. Entity extraction lies at the level of syntactic analysis while entity classification lies at the level of semantic analysis. According to Noam Chomsky's "Syntactic Structures," pp. 93-94 (Chomsky 1957), syntax is not appealed to semantics and semantics does not affect syntax. We analyze two benchmark datasets for the characteristics of named entities, finding that uncommon words can distinguish named entities from common text; where uncommon words are the words that hardly appear in common text and they are mainly the proper nouns. Experiments validate that lexical and syntactic features achieve state-of-the-art performance on entity extraction and that semantic features do not further improve the extraction performance, in both of our model and the state-of-the-art baselines. With Chomsky's view, we also explain the failure of joint syntactic and semantic parsings in other works.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/11/2019

Dependency-Aware Named Entity Recognition with Relative and Global Attentions

Named entity recognition is one of the core tasks in NLP. Although many ...
research
04/08/2015

Exploring Lexical, Syntactic, and Semantic Features for Chinese Textual Entailment in NTCIR RITE Evaluation Tasks

We computed linguistic information at the lexical, syntactic, and semant...
research
03/15/2017

Sparse Named Entity Classification using Factorization Machines

Named entity classification is the task of classifying text-based elemen...
research
11/07/2015

The Goldilocks Principle: Reading Children's Books with Explicit Memory Representations

We introduce a new test of how well language models capture meaning in c...
research
01/06/2016

Joint Learning of the Embedding of Words and Entities for Named Entity Disambiguation

Named Entity Disambiguation (NED) refers to the task of resolving multip...
research
01/28/2022

Boosting Entity Mention Detection for Targetted Twitter Streams with Global Contextual Embeddings

Microblogging sites, like Twitter, have emerged as ubiquitous sources of...
research
04/04/2021

ASPER: Attention-based Approach to Extract Syntactic Patterns denoting Semantic Relations in Sentential Context

Semantic relationships, such as hyponym-hypernym, cause-effect, meronym-...

Please sign up or login with your details

Forgot password? Click here to reset