Hierarchical correlation reconstruction with missing data, for example for biology-inspired neuron

04/17/2018
by   Jarek Duda, et al.
0

Machine learning often needs to estimate density from a multidimensional data sample, including modelling correlations between coordinates. Additionally, we often have missing data case: that data points contain only partial information - can miss information about some of coordinates. This article adapts rapid parametric density estimation technique for this purpose: modelling density as a linear combination of orthonormal functions, for which L^2 optimization says that (independently) estimated coefficient for a given function is just average over the sample of value of this function. Hierarchical correlation reconstruction first models probability density for each separate coordinate using all its appearances in data sample, then adds corrections from independently modelled pairwise correlations using all samples having both coordinates, and so on independently adding correlations for growing numbers of variables using decreasing evidence in our data sample. A basic application of such modelled multidimensional density can be imputation of missing coordinates: by inserting known coordinates to the density, and taking expected values for the missing coordinates, and maybe also variance to estimate their uncertainty. Biological neurons are seen as able to model and predict signals - the simplicity and flexibility of the presented approach makes it perfect for such biology-inspired artificial neuron.

READ FULL TEXT
research
04/17/2018

Hierarchical correlation reconstruction with missing data

Machine learning often needs to estimate density from a multidimensional...
research
12/19/2018

Credibility evaluation of income data with hierarchical correlation reconstruction

In situations like tax declarations or analyzes of household budgets we ...
research
06/04/2020

Handling missing data in model-based clustering

Gaussian Mixture models (GMMs) are a powerful tool for clustering, class...
research
05/28/2022

Angle-Uniform Parallel Coordinates

We present angle-uniform parallel coordinates, a data-independent techni...
research
08/14/2018

Multivariate Density Estimation with Missing Data

Multivariate density estimation is a popular technique in statistics wit...
research
01/19/2023

The Lost Art of Mathematical Modelling

We provide a critique of mathematical biology in light of rapid developm...
research
11/04/2019

Modelling bid-ask spread conditional distributions using hierarchical correlation reconstruction

While we would like to predict exact values, available incomplete inform...

Please sign up or login with your details

Forgot password? Click here to reset