Concept Identification of Directly and Indirectly Related Mentions Referring to Groups of Persons

07/02/2021
by   Anastasia Zhukova, et al.
0

Unsupervised concept identification through clustering, i.e., identification of semantically related words and phrases, is a common approach to identify contextual primitives employed in various use cases, e.g., text dimension reduction, i.e., replace words with the concepts to reduce the vocabulary size, summarization, and named entity resolution. We demonstrate the first results of an unsupervised approach for the identification of groups of persons as actors extracted from a set of related articles. Specifically, the approach clusters mentions of groups of persons that act as non-named entity actors in the texts, e.g., "migrant families" = "asylum-seekers." Compared to our baseline, the approach keeps the mentions of the geopolitical entities separated, e.g., "Iran leaders" != "European leaders," and clusters (in)directly related mentions with diverse wording, e.g., "American officials" = "Trump Administration."

READ FULL TEXT
POST COMMENT

Comments

There are no comments yet.

Authors

page 1

page 2

page 3

page 4

12/06/2017

Named Entity Sequence Classification

Named Entity Recognition (NER) aims at locating and classifying named en...
12/13/2021

ANEA: Automated (Named) Entity Annotation for German Domain-Specific Texts

Named entity recognition (NER) is an important task that aims to resolve...
09/01/2019

Pre-training of Deep Contextualized Embeddings of Words and Entities for Named Entity Disambiguation

Deep contextualized embeddings trained using unsupervised language model...
11/26/2018

Scalable graph-based individual named entity identification

Named entity discovery (NED) is an important information retrieval probl...
10/16/2018

Named Entity Analysis and Extraction with Uncommon Words

Most previous research treats named entity extraction and classification...
01/08/2018

Term Relevance Feedback for Contextual Named Entity Retrieval

We address the role of a user in Contextual Named Entity Retrieval (CNER...
05/24/2020

MASK: A flexible framework to facilitate de-identification of clinical texts

Medical health records and clinical summaries contain a vast amount of i...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.