A Comparative Study of Sentence Embedding Models for Assessing Semantic Variation

08/08/2023
by   Deven M. Mistry, et al.
0

Analyzing the pattern of semantic variation in long real-world texts such as books or transcripts is interesting from the stylistic, cognitive, and linguistic perspectives. It is also useful for applications such as text segmentation, document summarization, and detection of semantic novelty. The recent emergence of several vector-space methods for sentence embedding has made such analysis feasible. However, this raises the issue of how consistent and meaningful the semantic representations produced by various methods are in themselves. In this paper, we compare several recent sentence embedding methods via time-series of semantic similarity between successive sentences and matrices of pairwise sentence similarity for multiple books of literature. In contrast to previous work using target tasks and curated datasets to compare sentence embedding methods, our approach provides an evaluation of the methods 'in the wild'. We find that most of the sentence embedding methods considered do infer highly correlated patterns of semantic similarity in a given document, but show interesting differences.

READ FULL TEXT

page 4

page 5

page 6

page 7

page 8

page 9

research
05/22/2023

Sentence Representations via Gaussian Embedding

Recent progress in sentence embedding, which represents the meaning of a...
research
04/25/2020

Combining Word Embeddings and N-grams for Unsupervised Document Summarization

Graph-based extractive document summarization relies on the quality of t...
research
01/16/2019

Sentence transition matrix: An efficient approach that preserves sentence semantics

Sentence embedding is a significant research topic in the field of natur...
research
09/24/2018

Text Similarity in Vector Space Models: A Comparative Study

Automatic measurement of semantic text similarity is an important task i...
research
12/04/2020

On-Device Sentence Similarity for SMS Dataset

Determining the sentence similarity between Short Message Service (SMS) ...
research
11/09/2021

MNet-Sim: A Multi-layered Semantic Similarity Network to Evaluate Sentence Similarity

Similarity is a comparative-subjective measure that varies with the doma...
research
02/22/2020

Efficient Sentence Embedding via Semantic Subspace Analysis

A novel sentence embedding method built upon semantic subspace analysis,...

Please sign up or login with your details

Forgot password? Click here to reset