Occam's Ghost

06/15/2020
by   Peter Kovesarki, et al.
0

This article applies the principle of Occam's Razor to non-parametric model building of statistical data, by finding a model with the minimal number of bits, leading to an exceptionally effective regularization method for probability density estimators. The idea comes from the fact that likelihood maximization also minimizes the number of bits required to encode a dataset. However, traditional methods overlook that the optimization of model parameters may also inadvertently play the part in encoding data points. The article shows how to extend the bit counting to the model parameters as well, providing the first true measure of complexity for parametric models. Minimizing the total bit requirement of a model of a dataset favors smaller derivatives, smoother probability density function estimates and most importantly, a phase space with fewer relevant parameters. In fact, it is able prune parameters and detect features with small probability at the same time. It is also shown, how it can be applied to any smooth, non-parametric probability density estimator.

READ FULL TEXT

page 14

page 16

page 17

research
02/28/2018

Automatic topography of high-dimensional data sets by non-parametric Density Peak clustering

Data analysis in high-dimensional spaces aims at obtaining a synthetic d...
research
06/20/2018

Non-Parametric Calibration of Probabilistic Regression

The task of calibration is to retrospectively adjust the outputs from a ...
research
10/08/2022

An Efficient and Continuous Voronoi Density Estimator

We introduce a non-parametric density estimator deemed Radial Voronoi De...
research
08/06/2014

Empirical non-parametric estimation of the Fisher Information

The Fisher information matrix (FIM) is a foundational concept in statist...
research
06/22/2012

Estimating Densities with Non-Parametric Exponential Families

We propose a novel approach for density estimation with exponential fami...
research
07/01/2021

Demystifying statistical learning based on efficient influence functions

Evaluation of treatment effects and more general estimands is typically ...
research
01/27/2020

Structural Information Learning Machinery: Learning from Observing, Associating, Optimizing, Decoding, and Abstracting

In the present paper, we propose the model of structural information le...

Please sign up or login with your details

Forgot password? Click here to reset