CoSiNES: Contrastive Siamese Network for Entity Standardization

06/05/2023
by   Jiaqing Yuan, et al.
0

Entity standardization maps noisy mentions from free-form text to standard entities in a knowledge base. The unique challenge of this task relative to other entity-related tasks is the lack of surrounding context and numerous variations in the surface form of the mentions, especially when it comes to generalization across domains where labeled data is scarce. Previous research mostly focuses on developing models either heavily relying on context, or dedicated solely to a specific domain. In contrast, we propose CoSiNES, a generic and adaptable framework with Contrastive Siamese Network for Entity Standardization that effectively adapts a pretrained language model to capture the syntax and semantics of the entities in a new domain. We construct a new dataset in the technology domain, which contains 640 technical stack entities and 6,412 mentions collected from industrial content management systems. We demonstrate that CoSiNES yields higher accuracy and faster runtime than baselines derived from leading methods in this domain. CoSiNES also achieves competitive performance in four standard datasets from the chemistry, medicine, and biomedical domains, demonstrating its cross-domain applicability.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/12/2022

Focusing on Context is NICE: Improving Overshadowed Entity Disambiguation

Entity disambiguation (ED) is the task of mapping an ambiguous entity me...
research
10/15/2021

Cross-Domain Data Integration for Named Entity Disambiguation in Biomedical Text

Named entity disambiguation (NED), which involves mapping textual mentio...
research
07/09/2018

Jointly Embedding Entities and Text with Distant Supervision

Learning representations for knowledge base entities and concepts is bec...
research
10/21/2022

SpaBERT: A Pretrained Language Model from Geographic Data for Geo-Entity Representation

Named geographic entities (geo-entities for short) are the building bloc...
research
04/16/2022

Contrastive Learning with Hard Negative Entities for Entity Set Expansion

Entity Set Expansion (ESE) is a promising task which aims to expand enti...
research
09/28/2022

Cross-Domain Neural Entity Linking

Entity Linking is the task of matching a mention to an entity in a given...
research
05/17/2019

Distant Learning for Entity Linking with Automatic Noise Detection

Accurate entity linkers have been produced for domains and languages whe...

Please sign up or login with your details

Forgot password? Click here to reset