A Method for Estimating the Proximity of Vector Representation Groups in Multidimensional Space. On the Example of the Paraphrase Task

08/25/2019
by   A. Artemov, et al.
0

The following paper presents a method of comparing two sets of vectors. The method can be applied in all tasks, where it is necessary to measure the closeness of two objects presented as sets of vectors. It may be applicable when we compare the meanings of two sentences as part of the problem of paraphrasing. This is the problem of measuring semantic similarity of two sentences (group of words). The existing methods are not sensible for the word order or syntactic connections in the considered sentences. The method appears to be advantageous because it neither presents a group of words as one scalar value, nor does it try to show the closeness through an aggregation vector, which is mean for the set of vectors. Instead of that we measure the cosine of the angle as the mean for the first group vectors projections (the context) on one side and each vector of the second group on the other side. The similarity of two sentences defined by these means does not lose any semantic characteristics and takes account of the words traits. The method was verified on the comparison of sentence pairs in Russian.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/23/2016

Sentence Similarity Learning by Lexical Decomposition and Composition

Most conventional sentence similarity methods only focus on similar part...
research
08/27/2016

Testing APSyn against Vector Cosine on Similarity Estimation

In Distributional Semantic Models (DSMs), Vector Cosine is widely used t...
research
10/16/2018

Exploring Sentence Vector Spaces through Automatic Summarization

Given vector representations for individual words, it is necessary to co...
research
10/23/2017

Testing the limits of unsupervised learning for semantic similarity

Semantic Similarity between two sentences can be defined as a way to det...
research
07/14/2021

Efficient Approximate Search for Sets of Vectors

We consider a similarity measure between two sets A and B of vectors, th...
research
03/10/2014

Parsing using a grammar of word association vectors

This paper was was first drafted in 2001 as a formalization of the syste...
research
05/24/2018

WSD-algorithm based on new method of vector-word contexts proximity calculation via epsilon-filtration

The problem of word sense disambiguation (WSD) is considered in the arti...

Please sign up or login with your details

Forgot password? Click here to reset