Multi-Relational Hyperbolic Word Embeddings from Natural Language Definitions

05/12/2023
by   Marco Valentino, et al.
11

Neural-based word embeddings using solely distributional information have consistently produced useful meaning representations for downstream tasks. However, existing approaches often result in representations that are hard to interpret and control. Natural language definitions, on the other side, possess a recursive, self-explanatory semantic structure that can support novel representation learning paradigms able to preserve explicit conceptual relations and constraints in the vector space. This paper proposes a neuro-symbolic, multi-relational framework to learn word embeddings exclusively from natural language definitions by jointly mapping defined and defining terms along with their corresponding semantic relations. By automatically extracting the relations from definitions corpora and formalising the learning problem via a translational objective, we specialise the framework in hyperbolic space to capture the hierarchical and multi-resolution structure induced by the definitions. An extensive empirical analysis demonstrates that the framework can help impose the desired structural constraints while preserving the mapping required for controllable and interpretable semantic navigation. Moreover, the experiments reveal the superiority of the hyperbolic word embeddings over the euclidean counterparts and demonstrate that the multi-relational framework can obtain competitive results when compared to state-of-the-art neural approaches (including Transformers), with the advantage of being significantly more efficient and intrinsically interpretable.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/30/2018

Skip-gram word embeddings in hyperbolic space

Embeddings of tree-like graphs in hyperbolic space were recently shown t...
research
07/11/2016

Mapping distributional to model-theoretic semantic spaces: a baseline

Word embeddings have been shown to be useful across state-of-the-art sys...
research
09/07/2021

ArGoT: A Glossary of Terms extracted from the arXiv

We introduce ArGoT, a data set of mathematical terms extracted from the ...
research
05/27/2022

Semeval-2022 Task 1: CODWOE – Comparing Dictionaries and Word Embeddings

Word embeddings have advanced the state of the art in NLP across numerou...
research
11/23/2017

SPINE: SParse Interpretable Neural Embeddings

Prediction without justification has limited utility. Much of the succes...
research
04/26/2022

From Hyperbolic Geometry Back to Word Embeddings

We choose random points in the hyperbolic disc and claim that these poin...
research
09/10/2019

Definition Frames: Using Definitions for Hybrid Concept Representations

Concept representations is a particularly active area in NLP. Although r...

Please sign up or login with your details

Forgot password? Click here to reset