Exploring Wasserstein Distance across Concept Embeddings for Ontology Matching

07/22/2022
by   Yuan An, et al.
0

Measuring the distance between ontological elements is a fundamental component for any matching solutions. String-based distance metrics relying on discrete symbol operations are notorious for shallow syntactic matching. In this study, we explore Wasserstein distance metric across ontology concept embeddings. Wasserstein distance metric targets continuous space that can incorporate linguistic, structural, and logical information. In our exploratory study, we use a pre-trained word embeddings system, fasttext, to embed ontology element labels. We examine the effectiveness of Wasserstein distance for measuring similarity between (blocks of) ontolgoies, discovering matchings between individual elements, and refining matchings incorporating contextual information. Our experiments with the OAEI conference track and MSE benchmarks achieve competitive results compared to the leading systems such as AML and LogMap. Results indicate a promising trajectory for the application of optimal transport and Wasserstein distance to improve embedding-based unsupervised ontology matchings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/13/2019

Wasserstein Barycenter Model Ensembling

In this paper we propose to perform model ensembling in a multiclass or ...
research
05/08/2019

Learning Embeddings into Entropic Wasserstein Spaces

Euclidean embeddings of data are fundamentally limited in their ability ...
research
08/27/2021

Automatic Text Evaluation through the Lens of Wasserstein Barycenters

A new metric to evaluate text generation based on deep contextualized e...
research
06/13/2022

Asymptotics of smoothed Wasserstein distances in the small noise regime

We study the behavior of the Wasserstein-2 distance between discrete mea...
research
11/11/2022

Improving word mover's distance by leveraging self-attention matrix

Measuring the semantic similarity between two sentences is still an impo...
research
05/16/2022

Wasserstein t-SNE

Scientific datasets often have hierarchical structure: for example, in s...
research
05/24/2019

Personalized Purchase Prediction of Market Baskets with Wasserstein-Based Sequence Matching

Personalization in marketing aims at improving the shopping experience o...

Please sign up or login with your details

Forgot password? Click here to reset