The Historical Significance of Textual Distances

06/30/2018
by   Ted Underwood, et al.
0

Measuring similarity is a basic task in information retrieval, and now often a building-block for more complex arguments about cultural change. But do measures of textual similarity and distance really correspond to evidence about cultural proximity and differentiation? To explore that question empirically, this paper compares textual and social measures of the similarities between genres of English-language fiction. Existing measures of textual similarity (cosine similarity on tf-idf vectors or topic vectors) are also compared to new strategies that use supervised learning to anchor textual measurement in a social context.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/19/2019

Correlation Coefficients and Semantic Textual Similarity

A large body of research into semantic textual similarity has focused on...
research
09/11/2018

Evaluating Multimodal Representations on Sentence Similarity: vSTS, Visual Semantic Textual Similarity Dataset

In this paper we introduce vSTS, a new dataset for measuring textual sim...
research
04/05/2017

CompiLIG at SemEval-2017 Task 1: Cross-Language Plagiarism Detection Methods for Semantic Textual Similarity

We present our submitted systems for Semantic Textual Similarity (STS) T...
research
05/24/2023

CSTS: Conditional Semantic Textual Similarity

Semantic textual similarity (STS) has been a cornerstone task in NLP tha...
research
07/08/2021

A Triangle Inequality for Cosine Similarity

Similarity search is a fundamental problem for many data analysis techni...
research
11/01/2019

Finding the most similar textual documents using Case-Based Reasoning

In recent years, huge amounts of unstructured textual data on the Intern...
research
06/11/2012

Dimension Independent Similarity Computation

We present a suite of algorithms for Dimension Independent Similarity Co...

Please sign up or login with your details

Forgot password? Click here to reset