Unsupervised Hierarchy Matching with Optimal Transport over Hyperbolic Spaces

11/06/2019
by   David Alvarez-Melis, et al.
0

This paper focuses on the problem of unsupervised alignment of hierarchical data such as ontologies or lexical databases. This is a problem that appears across areas, from natural language processing to bioinformatics, and is typically solved by appeal to outside knowledge bases and label-textual similarity. In contrast, we approach the problem from a purely geometric perspective: given only a vector-space representation of the items in the two hierarchies, we seek to infer correspondences across them. Our work derives from and interweaves hyperbolic-space representations for hierarchical data, on one hand, and unsupervised word-alignment methods, on the other. We first provide a set of negative results showing how and why Euclidean methods fail in this hyperbolic setting. We then propose a novel approach based on optimal transport over hyperbolic spaces, and show that it outperforms standard embedding alignment techniques in various experiments on cross-lingual WordNet alignment and ontology matching tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/02/2020

Aligning Hyperbolic Representations: an Optimal Transport-based approach

Hyperbolic-spaces are better suited to represent data with underlying hi...
research
05/04/2022

Cross-lingual Word Embeddings in Hyperbolic Space

Cross-lingual word embeddings can be applied to several natural language...
research
06/12/2018

Embedding Text in Hyperbolic Spaces

Natural language text exhibits hierarchical structure in a variety of re...
research
10/06/2021

Using Optimal Transport as Alignment Objective for fine-tuning Multilingual Contextualized Embeddings

Recent studies have proposed different methods to improve multilingual w...
research
08/31/2018

Gromov-Wasserstein Alignment of Word Embedding Spaces

Cross-lingual or cross-domain correspondences play key roles in tasks ra...
research
06/07/2023

Unbalanced Optimal Transport for Unbalanced Word Alignment

Monolingual word alignment is crucial to model semantic interactions bet...
research
08/14/2020

Weakly supervised cross-domain alignment with optimal transport

Cross-domain alignment between image objects and text sequences is key t...

Please sign up or login with your details

Forgot password? Click here to reset