
Hierarchical correlation reconstruction with missing data, for example for biologyinspired neuron
Machine learning often needs to estimate density from a multidimensional...
read it

Credibility evaluation of income data with hierarchical correlation reconstruction
In situations like tax declarations or analyzes of household budgets we ...
read it

Handling missing data in modelbased clustering
Gaussian Mixture models (GMMs) are a powerful tool for clustering, class...
read it

Multivariate Density Estimation with Missing Data
Multivariate density estimation is a popular technique in statistics wit...
read it

A random model for multidimensional fitting method
Multidimensional fitting (MDF) method is a multivariate data analysis me...
read it

Missing Data Estimation in HighDimensional Datasets: A Swarm IntelligenceDeep Neural Network Approach
In this paper, we examine the problem of missing data in highdimensiona...
read it

Reconstruction of sequential data with density models
We introduce the problem of reconstructing a sequence of multidimensiona...
read it
Hierarchical correlation reconstruction with missing data
Machine learning often needs to estimate density from a multidimensional data sample, where we would also like to model correlations between coordinates. Additionally, we often have missing data case: that data points have only partial information  can miss information about some coordinates. This paper adapts rapid parametric density estimation technique for this purpose: modelling density as a linear combination, for which L^2 optimization says that estimated coefficient for a given function is just average over the sample of this function. Hierarchical correlation reconstruction first models probability density for each separate coordinate using all its appearances in data sample, then adds corrections from independently modelled pairwise correlations using all samples having both coordinates, and so on independently adding correlations for growing numbers of variables using decreasing evidence in our data sample. A basic application of such modelled multidimensional density can be imputation of missing coordinates: by inserting known coordinates to the density, and taking expected values for the missing coordinates, and maybe also variance to estimate their uncertainty.
READ FULL TEXT
Comments
There are no comments yet.