A Deep Learning Approach to Geographical Candidate Selection through Toponym Matching

09/17/2020
by   Mariona Coll Ardanuy, et al.
0

Recognizing toponyms and resolving them to their real-world referents is required for providing advanced semantic access to textual data. This process is often hindered by the high degree of variation in toponyms. Candidate selection is the task of identifying the potential entities that can be referred to by a toponym previously recognized. While it has traditionally received little attention in the research community, it has been shown that candidate selection has a significant impact on downstream tasks (i.e. entity resolution), especially in noisy or non-standard text. In this paper, we introduce a flexible deep learning method for candidate selection through toponym matching, using state-of-the-art neural network architectures. We perform an intrinsic toponym matching evaluation based on several new realistic datasets, which cover various challenging scenarios (cross-lingual and regional variations, as well as OCR errors). We report its performance on candidate selection in the context of the downstream task of toponym resolution, both on existing datasets and on a new manually-annotated resource of nineteenth-century English OCR'd text.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/02/2020

Design Challenges for Low-resource Cross-lingual Entity Linking

Cross-lingual Entity Linking (XEL) grounds mentions of entities that app...
research
09/19/2023

Unsupervised Deep Cross-Language Entity Alignment

Cross-lingual entity alignment is the task of finding the same semantic ...
research
10/23/2020

Synthetic Data Augmentation for Zero-Shot Cross-Lingual Question Answering

Coupled with the availability of large scale datasets, deep learning arc...
research
06/30/2022

Efficient Entity Candidate Generation for Low-Resource Languages

Candidate generation is a crucial module in entity linking. It also play...
research
03/03/2020

Improving Candidate Generation for Low-resource Cross-lingual Entity Linking

Cross-lingual entity linking (XEL) is the task of finding referents in a...
research
09/19/2022

ALEXSIS-PT: A New Resource for Portuguese Lexical Simplification

Lexical simplification (LS) is the task of automatically replacing compl...

Please sign up or login with your details

Forgot password? Click here to reset