From Small Scales to Large Scales: Distance-to-Measure Density based Geometric Analysis of Complex Data

05/16/2022
by   Katharina Proksch, et al.
0

How can we tell complex point clouds with different small scale characteristics apart, while disregarding global features? Can we find a suitable transformation of such data in a way that allows to discriminate between differences in this sense with statistical guarantees? In this paper, we consider the analysis and classification of complex point clouds as they are obtained, e.g., via single molecule localization microscopy. We focus on the task of identifying differences between noisy point clouds based on small scale characteristics, while disregarding large scale information such as overall size. We propose an approach based on a transformation of the data via the so-called Distance-to-Measure (DTM) function, a transformation which is based on the average of nearest neighbor distances. For each data set, we estimate the probability density of average local distances of all data points and use the estimated densities for classification. While the applicability is immediate and the practical performance of the proposed methodology is very good, the theoretical study of the density estimators is quite challenging, as they are based on i.i.d. observations that have been obtained via a complicated transformation. In fact, the transformed data are stochastically dependent in a non-local way that is not captured by commonly considered dependence measures. Nonetheless, we show that the asymptotic behaviour of the density estimator is driven by a kernel density estimator of certain i.i.d. random variables by using theoretical properties of U-statistics, which allows to handle the dependencies via a Hoeffding decomposition. We show via a numerical study and in an application to simulated single molecule localization microscopy data of chromatin fibers that unsupervised classification tasks based on estimated DTM-densities achieve excellent separation results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/08/2022

Real-time Rail Recognition Based on 3D Point Clouds

Accurate rail location is a crucial part in the railway support driving ...
research
11/10/2021

A K-function for inhomogeneous random measures with geometric features

This paper introduces a K-function for assessing second-order properties...
research
07/04/2023

Semantic Segmentation on 3D Point Clouds with High Density Variations

LiDAR scanning for surveying applications acquire measurements over wide...
research
09/11/2020

A Density-Aware PointRCNN for 3D Objection Detection in Point Clouds

We present an improved version of PointRCNN for 3D object detection, in ...
research
01/31/2020

Local intrinsic dimensionality estimators based on concentration of measure

Intrinsic dimensionality (ID) is one of the most fundamental characteris...
research
11/25/2019

Nearest Neighbor Sampling of Point Sets using Random Rays

We propose a new framework for the sampling, compression, and analysis o...
research
12/14/2017

Adaptive kNN using Expected Accuracy for Classification of Geo-Spatial Data

The k-Nearest Neighbor (kNN) classification approach is conceptually sim...

Please sign up or login with your details

Forgot password? Click here to reset