A Measure of Similarity in Textual Data Using Spearman's Rank Correlation Coefficient

11/26/2019
by   Nino Arsov, et al.
0

In the last decade, many diverse advances have occurred in the field of information extraction from data. Information extraction in its simplest form takes place in computing environments, where structured data can be extracted through a series of queries. The continuous expansion of quantities of data have therefore provided an opportunity for knowledge extraction (KE) from a textual document (TD). A typical problem of this kind is the extraction of common characteristics and knowledge from a group of TDs, with the possibility to group such similar TDs in a process known as clustering. In this paper we present a technique for such KE among a group of TDs related to the common characteristics and meaning of their content. Our technique is based on the Spearman's Rank Correlation Coefficient (SRCC), for which the conducted experiments have proven to be comprehensive measure to achieve a high-quality KE.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/19/2019

Correlation Coefficients and Semantic Textual Similarity

A large body of research into semantic textual similarity has focused on...
research
07/15/2021

Auto-detecting groups based on textual similarity for group recommendations

In general, recommender systems are designed to provide personalized ite...
research
05/09/2013

A Rank Minrelation - Majrelation Coefficient

Improving the detection of relevant variables using a new bivariate meas...
research
06/05/2012

Possibilistic Pertinence Feedback and Semantic Networks for Goal's Extraction

Pertinence Feedback is a technique that enables a user to interactively ...
research
01/14/2018

DCDistance: A Supervised Text Document Feature extraction based on class labels

Text Mining is a field that aims at extracting information from textual ...
research
01/13/2022

PageRank Algorithm using Eigenvector Centrality – New Approach

The purpose of the research is to find a centrality measure that can be ...
research
05/13/2019

A Review of Keyphrase Extraction

Automated keyphrase extraction is a crucial textual information processi...

Please sign up or login with your details

Forgot password? Click here to reset