Probability Distribution on Full Rooted Trees

09/27/2021
by   Yuta Nakahara, et al.
0

The recursive and hierarchical structure of full rooted trees is applicable to represent statistical models in various areas, such as data compression, image processing, and machine learning. In most of these cases, the full rooted tree is not a random variable; as such, model selection to avoid overfitting becomes problematic. A method to solve this problem is to assume a prior distribution on the full rooted trees. This enables overfitting to be avoided based on the Bayes decision theory. For example, by assigning a low prior probability to a complex model, the maximum a posteriori estimator prevents overfitting. Furthermore, overfitting can be avoided by averaging all the models weighted by their posteriors. In this paper, we propose a probability distribution on a set of full rooted trees. Its parametric representation is suitable for calculating the properties of our distribution using recursive functions, such as the mode, expectation, and posterior distribution. Although such distributions have been proposed in previous studies, they are only applicable to specific applications. Therefore, we extract their mathematically essential components and derive new generalized methods to calculate the expectation, posterior distribution, etc.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/24/2022

Probability Distribution on Rooted Trees

The hierarchical and recursive expressive capability of rooted trees is ...
research
06/12/2023

Prediction Algorithms Achieving Bayesian Decision Theoretical Optimality Based on Decision Trees as Data Observation Processes

In the field of decision trees, most previous studies have difficulty en...
research
03/17/2023

Batch Updating of a Posterior Tree Distribution over a Meta-Tree

Previously, we proposed a probabilistic data generation model represente...
research
07/26/2021

From robust tests to Bayes-like posterior distributions

In the Bayes paradigm and for a given loss function, we propose the cons...
research
11/29/2017

Objective Bayesian inference with proper scoring rules

Standard Bayesian analyses can be difficult to perform when the full lik...
research
12/08/2019

Contrast Trees and Distribution Boosting

Often machine learning methods are applied and results reported in cases...
research
11/14/2022

Scalable Model Selection for Staged Trees: Mean-posterior Clustering and Binary Trees

Several structure-learning algorithms for staged trees, asymmetric exten...

Please sign up or login with your details

Forgot password? Click here to reset