Probabilistic treatment of the uncertainty from the finite size of weighted Monte Carlo data

12/04/2017
by   Thorsten Glüsenkamp, et al.
0

The finite size of Monte Carlo samples carries intrinsic uncertainty that can lead to a substantial bias in parameter estimation if it is neglected and the sample size is small. We introduce a probabilistic treatment of this problem by replacing the usual likelihood functions with novel generalized probability distributions that incorporate the finite statistics via suitable marginalization. These new PDFs are analytic, and can be used to replace the Poisson, multinomial, and sample-based unbinned likelihoods, which covers many use cases in high-energy physics. In the limit of infinite statistics, they reduce to the respective standard probability distributions. In the general case of arbitrary Monte Carlo weights, the expressions involve the fourth Lauricella function F_D, for which we find a new representation as a contour integral that allows an exact and efficient calculation. The result also entails a new expression for the probability generating function of the Dirichlet-multinomial distribution with integer parameters. We demonstrate the bias reduction of our approach with a typical toy Monte Carlo problem, estimating the normalization of a peak in a falling energy spectrum, and compare the results with previously published methods from the literature.

READ FULL TEXT
research
02/23/2019

A unified perspective on modified Poisson likelihoods for limited Monte Carlo data

Counting experiments often rely on Monte Carlo simulations for predictio...
research
02/26/2022

A Log-Gaussian Cox Process with Sequential Monte Carlo for Line Narrowing in Spectroscopy

We propose a statistical model for narrowing line shapes in spectroscopy...
research
07/03/2018

Finite Sample L_2 Bounds for Sequential Monte Carlo and Adaptive Path Selection

We prove a bound on the finite sample error of sequential Monte Carlo (S...
research
11/20/2019

Automatic Differentiable Monte Carlo: Theory and Application

Differentiable programming has emerged as a key programming paradigm emp...
research
04/09/2018

An evaluation of estimation techniques for probabilistic reachability

We evaluate numerically-precise Monte Carlo (MC), Quasi-Monte Carlo (QMC...
research
01/21/2020

MonteCarloMeasurements.jl: Nonlinear Propagation of Arbitrary Multivariate Distributions by means of Method Overloading

This manuscript outlines a software package that facilitates working wit...
research
02/04/2021

HMC, an Algorithms in Data Mining, the Functional Analysis approach

The main purpose of this paper is to facilitate the communication betwee...

Please sign up or login with your details

Forgot password? Click here to reset