Multi-Source Spatial Entity Linkage

11/20/2019
by   Suela Isaj, et al.
0

Besides the traditional cartographic data sources, spatial information can also be derived from location-based sources. However, even though different location-based sources refer to the same physical world, each one has only partial coverage of the spatial entities, describe them with different attributes, and sometimes provide contradicting information. Hence, we introduce the spatial entity linkage problem, which finds which pairs of spatial entities belong to the same physical spatial entity. Our proposed solution (QuadSky) starts with a spatial blocking technique (QuadFlex) that creates blocks of nearby spatial entities with the time complexity of the quadtree algorithm. After pairwise comparing the spatial entities in the same block, we propose the SkyRank algorithm that ranks the compared pairs using Pareto optimality. We introduce the SkyEx-* family of algorithms that can classify the pairs with 0.85 precision and 0.85 recall for a manually labeled dataset of 1,500 pairs and 0.87 precision and 0.6 recall for a semi-manually labeled dataset of 777,452 pairs. Moreover, our fully unsupervised algorithm SkyEx-D approximates the optimal result with an F-measure loss of just 0.01. Finally, QuadSky provides the best trade-off between precision and recall and the best F-measure compared to the existing baselines.

READ FULL TEXT

page 9

page 15

research
09/20/2016

An Ensemble Blocking Scheme for Entity Resolution of Large and Sparse Datasets

Entity Resolution, also called record linkage or deduplication, refers t...
research
04/22/2021

Exploiting Transitivity Constraints for Entity Matching in Knowledge Graphs

The goal of entity matching in knowledge graphs is to identify entities ...
research
11/28/2017

Classification of entities via their descriptive sentences

Hypernym identification of open-domain entities is crucial for taxonomy ...
research
10/21/2022

SpaBERT: A Pretrained Language Model from Geographic Data for Geo-Entity Representation

Named geographic entities (geo-entities for short) are the building bloc...
research
05/12/2022

Comparing Open Arabic Named Entity Recognition Tools

The main objective of this paper is to compare and evaluate the performa...
research
04/13/2020

SLIM: Scalable Linkage of Mobility Data

We present a scalable solution to link entities across mobility datasets...
research
02/01/2021

Inferring spatial relations from textual descriptions of images

Generating an image from its textual description requires both a certain...

Please sign up or login with your details

Forgot password? Click here to reset