Evaluating semantic models with word-sentence relatedness

03/23/2016
by   Kimberly Glasgow, et al.
0

Semantic textual similarity (STS) systems are designed to encode and evaluate the semantic similarity between words, phrases, sentences, and documents. One method for assessing the quality or authenticity of semantic information encoded in these systems is by comparison with human judgments. A data set for evaluating semantic models was developed consisting of 775 English word-sentence pairs, each annotated for semantic relatedness by human raters engaged in a Maximum Difference Scaling (MDS) task, as well as a faster alternative task. As a sample application of this relatedness data, behavior-based relatedness was compared to the relatedness computed via four off-the-shelf STS models: n-gram, Latent Semantic Analysis (LSA), Word2Vec, and UMBC Ebiquity. Some STS models captured much of the variance in the human judgments collected, but they were not sensitive to the implicatures and entailments that were processed and considered by the participants. All text stimuli and judgment data have been made freely available.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/15/2018

Introducing two Vietnamese Datasets for Evaluating Semantic Models of (Dis-)Similarity and Relatedness

We present two novel datasets for the low-resource language Vietnamese t...
research
02/17/2016

A Comprehensive Comparative Study of Word and Sentence Similarity Measures

Sentence similarity is considered the basis of many natural language tas...
research
09/17/2019

Semantic Relatedness Based Re-ranker for Text Spotting

Applications such as textual entailment, plagiarism detection or documen...
research
08/19/2017

ClaC: Semantic Relatedness of Words and Phrases

The measurement of phrasal semantic relatedness is an important metric f...
research
10/17/2020

Incorporate Semantic Structures into Machine Translation Evaluation via UCCA

Copying mechanism has been commonly used in neural paraphrasing networks...
research
04/30/2019

Model Comparison for Semantic Grouping

We introduce a probabilistic framework for quantifying the semantic simi...
research
05/29/2020

Harbsafe-162. A Domain-Specific Data Set for the Intrinsic Evaluation of Semantic Representations for Terminological Data

The article presents Harbsafe-162, a domain-specific data set for evalua...

Please sign up or login with your details

Forgot password? Click here to reset