Bridging Continuous and Discrete Spaces: Interpretable Sentence Representation Learning via Compositional Operations

05/24/2023
by   James Y. Huang, et al.
0

Traditional sentence embedding models encode sentences into vector representations to capture useful properties such as the semantic similarity between sentences. However, in addition to similarity, sentence semantics can also be interpreted via compositional operations such as sentence fusion or difference. It is unclear whether the compositional semantics of sentences can be directly reflected as compositional operations in the embedding space. To more effectively bridge the continuous embedding and discrete text spaces, we explore the plausibility of incorporating various compositional properties into the sentence embedding space that allows us to interpret embedding transformations as compositional sentence operations. We propose InterSent, an end-to-end framework for learning interpretable sentence embeddings that supports compositional sentence operations in the embedding space. Our method optimizes operator networks and a bottleneck encoder-decoder model to produce meaningful and interpretable sentence embeddings. Experimental results demonstrate that our method significantly improves the interpretability of sentence embeddings on four textual generation tasks over existing approaches while maintaining strong performance on traditional semantic similarity tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/02/2021

Clustering and Network Analysis for the Embedding Spaces of Sentences and Sub-Sentences

Sentence embedding methods offer a powerful approach for working with sh...
research
11/10/2019

A Bilingual Generative Transformer for Semantic Sentence Embedding

Semantic sentence embedding models encode natural language sentences int...
research
08/07/2023

Topological Interpretations of GPT-3

This is an experiential study of investigating a consistent method for d...
research
04/18/2023

D2CSE: Difference-aware Deep continuous prompts for Contrastive Sentence Embeddings

This paper describes Difference-aware Deep continuous prompt for Contras...
research
12/23/2018

Improving Context-Aware Semantic Relationships in Sparse Mobile Datasets

Traditional semantic similarity models often fail to encapsulate the ext...
research
10/05/2022

Unsupervised Sentence Textual Similarity with Compositional Phrase Semantics

Measuring Sentence Textual Similarity (STS) is a classic task that can b...
research
04/22/2018

A Study on Passage Re-ranking in Embedding based Unsupervised Semantic Search

State of the art approaches for (embedding based) unsupervised semantic ...

Please sign up or login with your details

Forgot password? Click here to reset