Mathematical Foundations of Data Cohesion

08/01/2023
by   Katherine E. Moore, et al.
0

Data cohesion, a recently introduced measure inspired by social interactions, uses distance comparisons to assess relative proximity. In this work, we provide a collection of results which can guide the development of cohesion-based methods in exploratory data analysis and human-aided computation. Here, we observe the important role of highly clustered "point-like" sets and the ways in which cohesion allows such sets to take on qualities of a single weighted point. In doing so, we see how cohesion complements metric-adjacent measures of dissimilarity and responds to local density. We conclude by proving that cohesion is the unique function with (i) average value equal to one-half and (ii) the property that the influence of an outlier is proportional to its mass. Properties of cohesion are illustrated with examples throughout.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/07/2023

Towards High-Performance Exploratory Data Analysis (EDA) Via Stable Equilibrium Point

Exploratory data analysis (EDA) is a vital procedure for data science pr...
research
09/21/2016

On Data-Independent Properties for Density-Based Dissimilarity Measures in Hybrid Clustering

Hybrid clustering combines partitional and hierarchical clustering for c...
research
12/15/2010

Descriptive-complexity based distance for fuzzy sets

A new distance function dist(A,B) for fuzzy sets A and B is introduced. ...
research
11/25/2014

Similarity- based approach for outlier detection

This paper presents a new approach for detecting outliers by introducing...
research
12/13/2020

k-Variance: A Clustered Notion of Variance

We introduce k-variance, a generalization of variance built on the machi...
research
02/01/2021

Splinets – splines through the Taylor expansion, their support sets and orthogonal bases

A new representation of splines that targets efficiency in the analysis ...

Please sign up or login with your details

Forgot password? Click here to reset