Measuring spatial uniformity with the hypersphere chord length distribution

04/12/2020
by   Panagiotis Sidiropoulos, et al.
0

Data uniformity is a concept associated with several semantic data characteristics such as lack of features, correlation and sample bias. This article introduces a novel measure to assess data uniformity and detect uniform pointsets on high-dimensional Euclidean spaces. Spatial uniformity measure builds upon the isomorphism between hyperspherical chords and L2-normalised data Euclidean distances, which is implied by the fact that, in Euclidean spaces, L2-normalised data can be geometrically defined as points on a hypersphere. The imposed connection between the distance distribution of uniformly selected points and the hyperspherical chord length distribution is employed to quantify uniformity. More specifically,, the closed-form expression of hypersphere chord length distribution is revisited extended, before examining a few qualitative and quantitative characteristics of this distribution that can be rather straightforwardly linked to data uniformity. The experimental section includes validation in four distinct setups, thus substantiating the potential of the new uniformity measure on practical data-science applications.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/04/2023

Existence and approximation of densities of chord length- and cross section area distributions

In various stereological problems an n-dimensional convex body is inters...
research
06/16/2023

Spatial depth for data in metric spaces

We propose a novel measure of statistical depth, the metric spatial dept...
research
07/15/2021

Nonparametric Statistical Inference via Metric Distribution Function in Metric Spaces

Distribution function is essential in statistical inference, and connect...
research
09/09/2021

Online Search for a Hyperplane in High-Dimensional Euclidean Space

We consider the online search problem in which a server starting at the ...
research
11/20/2014

N-sphere chord length distribution

This work studies the chord length distribution, in the case where both ...
research
07/11/2023

Measure transfer via stochastic slicing and matching

This paper studies iterative schemes for measure transfer and approximat...
research
10/29/2019

Meta Distribution of SIR in Ultra-Dense Networks with Bipartite Euclidean Matchings

Ultra-dense networks maximise spatial spectral efficiency through spatia...

Please sign up or login with your details

Forgot password? Click here to reset