Using Eigencentrality to Estimate Joint, Conditional and Marginal Probabilities from Mixed-Variable Data: Method and Applications

09/19/2018
by   Andrew Skabar, et al.
6

The ability to estimate joint, conditional and marginal probability distributions over some set of variables is of great utility for many common machine learning tasks. However, estimating these distributions can be challenging, particularly in the case of data containing a mix of discrete and continuous variables. This paper presents a non-parametric method for estimating these distributions directly from a dataset. The data are first represented as a graph consisting of object nodes and attribute value nodes. Depending on the distribution to be estimated, an appropriate eigenvector equation is then constructed. This equation is then solved to find the corresponding stationary distribution of the graph, from which the required distributions can then be estimated and sampled from. The paper demonstrates how the method can be applied to many common machine learning tasks including classification, regression, missing value imputation, outlier detection, random vector generation, and clustering.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/27/2020

A copula transformation in multivariate mixed discrete-continuous models

Copulas allow a flexible and simultaneous modeling of complicated depend...
research
01/10/2013

Sufficiency, Separability and Temporal Probabilistic Models

Suppose we are given the conditional probability of one variable given s...
research
06/27/2012

Identifying the Relevant Nodes Without Learning the Model

We propose a method to identify all the nodes that are relevant to compu...
research
05/07/2001

Joint and conditional estimation of tagging and parsing models

This paper compares two different ways of estimating statistical languag...
research
11/17/2021

GFlowNet Foundations

Generative Flow Networks (GFlowNets) have been introduced as a method to...
research
05/26/2023

Quantum Kernel Mixtures for Probabilistic Deep Learning

This paper presents a novel approach to probabilistic deep learning (PDL...
research
11/14/2019

Estimating differential entropy using recursive copula splitting

A method for estimating the Shannon differential entropy of multidimensi...

Please sign up or login with your details

Forgot password? Click here to reset