MESSY Estimation: Maximum-Entropy based Stochastic and Symbolic densitY Estimation

06/07/2023
by   Tony Tohme, et al.
0

We introduce MESSY estimation, a Maximum-Entropy based Stochastic and Symbolic densitY estimation method. The proposed approach recovers probability density functions symbolically from samples using moments of a Gradient flow in which the ansatz serves as the driving force. In particular, we construct a gradient-based drift-diffusion process that connects samples of the unknown distribution function to a guess symbolic expression. We then show that when the guess distribution has the maximum entropy form, the parameters of this distribution can be found efficiently by solving a linear system of equations constructed using the moments of the provided samples. Furthermore, we use Symbolic regression to explore the space of smooth functions and find optimal basis functions for the exponent of the maximum entropy functional leading to good conditioning. The cost of the proposed method in each iteration of the random search is linear with the number of samples and quadratic with the number of basis functions. We validate the proposed MESSY estimation method against other benchmark methods for the case of a bi-modal and a discontinuous density, as well as a density at the limit of physical realizability. We find that the addition of a symbolic search for basis functions improves the accuracy of the estimation at a reasonable additional computational cost. Our results suggest that the proposed method outperforms existing density recovery methods in the limit of a small to moderate number of samples by providing a low-bias and tractable symbolic description of the unknown density at a reasonable computational cost.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/01/2021

Bayesian filtering for nonlinear stochastic systems using holonomic gradient method with integral transform

This paper proposes a symbolic-numeric Bayesian filtering method for a c...
research
02/29/2020

Design optimization of stochastic complex systems via iterative density estimation

Reliability-based design optimization (RBDO) provides a rational and sou...
research
12/01/2022

High-dimensional density estimation with tensorizing flow

We propose the tensorizing flow method for estimating high-dimensional p...
research
09/28/2015

Distance-Penalized Active Learning Using Quantile Search

Adaptive sampling theory has shown that, with proper assumptions on the ...
research
04/07/2010

On Tsallis Entropy Bias and Generalized Maximum Entropy Models

In density estimation task, maximum entropy model (Maxent) can effective...
research
06/03/2019

Temporal Density Extrapolation using a Dynamic Basis Approach

Density estimation is a versatile technique underlying many data mining ...
research
11/19/2014

Unification of field theory and maximum entropy methods for learning probability densities

The need to estimate smooth probability distributions (a.k.a. probabilit...

Please sign up or login with your details

Forgot password? Click here to reset