Measuring the Measuring Tools: An Automatic Evaluation of Semantic Metrics for Text Corpora

11/29/2022
by   George Kour, et al.
0

The ability to compare the semantic similarity between text corpora is important in a variety of natural language processing applications. However, standard methods for evaluating these metrics have yet to be established. We propose a set of automatic and interpretable measures for assessing the characteristics of corpus-level semantic similarity metrics, allowing sensible comparison of their behavior. We demonstrate the effectiveness of our evaluation measures in capturing fundamental characteristics by evaluating them on a collection of classical and state-of-the-art metrics. Our measures revealed that recently-developed metrics are becoming better in identifying semantic distributional mismatch while classical metrics are more sensitive to perturbations in the surface text levels.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/19/2017

ClaC: Semantic Relatedness of Words and Phrases

The measurement of phrasal semantic relatedness is an important metric f...
research
07/02/2022

FRAME: Evaluating Simulatability Metrics for Free-Text Rationales

Free-text rationales aim to explain neural language model (LM) behavior ...
research
10/06/2019

Measuring Sentences Similarity: A Survey

This study is to review the approaches used for measuring sentences simi...
research
12/11/2019

CoSimLex: A Resource for Evaluating Graded Word Similarity in Context

State of the art natural language processing tools are built on context-...
research
03/19/2020

Diversity, Density, and Homogeneity: Quantitative Characteristic Metrics for Text Collections

Summarizing data samples by quantitative measures has a long history, wi...
research
10/30/2020

Semantic similarity-based approach to enhance supervised classification learning accuracy

This brief communication discusses the usefulness of semantic similarity...
research
08/24/2019

DAST Model: Deciding About Semantic Complexity of a Text

Measuring of text complexity is a needed task in several domains and app...

Please sign up or login with your details

Forgot password? Click here to reset