Unbalanced Optimal Transport for Unbalanced Word Alignment

06/07/2023
by   Yuki Arase, et al.
0

Monolingual word alignment is crucial to model semantic interactions between sentences. In particular, null alignment, a phenomenon in which words have no corresponding counterparts, is pervasive and critical in handling semantically divergent sentences. Identification of null alignment is useful on its own to reason about the semantic similarity of sentences by indicating there exists information inequality. To achieve unbalanced word alignment that values both alignment and null alignment, this study shows that the family of optimal transport (OT), i.e., balanced, partial, and unbalanced OT, are natural and powerful approaches even without tailor-made techniques. Our extensive experiments covering unsupervised and supervised settings indicate that our generic OT-based alignment methods are competitive against the state-of-the-arts specially designed for word alignment, remarkably on challenging datasets with high null alignment frequencies.

READ FULL TEXT

page 1

page 9

research
11/11/2022

Improving word mover's distance by leveraging self-attention matrix

Measuring the semantic similarity between two sentences is still an impo...
research
10/06/2021

Using Optimal Transport as Alignment Objective for fine-tuning Multilingual Contextualized Embeddings

Recent studies have proposed different methods to improve multilingual w...
research
01/28/2020

Structural-Aware Sentence Similarity with Recursive Optimal Transport

Measuring sentence similarity is a classic topic in natural language pro...
research
01/30/2023

Robust Attributed Graph Alignment via Joint Structure Learning and Optimal Transport

Graph alignment, which aims at identifying corresponding entities across...
research
11/06/2019

Unsupervised Hierarchy Matching with Optimal Transport over Hyperbolic Spaces

This paper focuses on the problem of unsupervised alignment of hierarchi...
research
05/25/2023

When do exact and powerful p-values and e-values exist?

Given a composite null 𝒫 and composite alternative 𝒬, when and how can w...
research
09/06/2022

Monolingual alignment of word senses and definitions in lexicographical resources

The focus of this thesis is broadly on the alignment of lexicographical ...

Please sign up or login with your details

Forgot password? Click here to reset