Forest Density Estimation

01/10/2010
by   Han Liu, et al.
0

We study graph estimation and density estimation in high dimensions, using a family of density estimators based on forest structured undirected graphical models. For density estimation, we do not assume the true distribution corresponds to a forest; rather, we form kernel density estimates of the bivariate and univariate marginals, and apply Kruskal's algorithm to estimate the optimal forest on held out data. We prove an oracle inequality on the excess risk of the resulting estimator relative to the risk of the best forest. For graph estimation, we consider the problem of estimating forests with restricted tree sizes. We prove that finding a maximum weight spanning forest with restricted tree size is NP-hard, and develop an approximation algorithm for this problem. Viewing the tree size as a complexity parameter, we then select a forest using data splitting, and prove bounds on excess risk and structure selection consistency of the procedure. Experiments with simulated data and microarray data indicate that the methods are a practical alternative to Gaussian graphical models.

READ FULL TEXT
research
11/12/2015

Learning Nonparametric Forest Graphical Models with Prior Information

We present a framework for incorporating prior information into nonparam...
research
06/06/2018

Semiparametric Classification of Forest Graphical Models

We propose a new semiparametric approach to binary classification that e...
research
10/23/2020

Smoothing and adaptation of shifted Pólya Tree ensembles

Recently, S. Arlot and R. Genuer have shown that a model of random fores...
research
03/30/2019

Combining Smoothing Spline with Conditional Gaussian Graphical Model for Density and Graph Estimation

Multivariate density estimation and graphical models play important role...
research
03/18/2022

ISDE : Independence Structure Density Estimation

Density estimation appears as a subroutine in many learning procedures, ...
research
06/07/2015

Optimal Ridge Detection using Coverage Risk

We introduce the concept of coverage risk as an error measure for densit...
research
10/22/2015

Cascaded High Dimensional Histograms: A Generative Approach to Density Estimation

We present tree- and list- structured density estimation methods for hig...

Please sign up or login with your details

Forgot password? Click here to reset