Optimising HEP parameter fits via Monte Carlo weight derivative regression

03/28/2020
by   Andrea Valassi, et al.
0

HEP event selection is traditionally considered a binary classification problem, involving the dichotomous categories of signal and background. In distribution fits for particle masses or couplings, however, signal events are not all equivalent, as the signal differential cross section has different sensitivities to the measured parameter in different regions of phase space. In this paper, I describe a mathematical framework for the evaluation and optimization of HEP parameter fits, where this sensitivity is defined on an event-by-event basis, and for MC events it is modeled in terms of their MC weight derivatives with respect to the measured parameter. Minimising the statistical error on a measurement implies the need to resolve (i.e. separate) events with different sensitivities, which ultimately represents a non-dichotomous classification problem. Since MC weight derivatives are not available for real data, the practical strategy I suggest consists in training a regressor of weight derivatives against MC events, and then using it as an optimal partitioning variable for 1-dimensional fits of data events. This CHEP2019 paper is an extension of the study presented at CHEP2018: in particular, event-by-event sensitivities allow the exact computation of the "FIP" ratio between the Fisher information obtained from an analysis and the maximum information that could possibly be obtained with an ideal detector. Using this expression, I discuss the relationship between FIP and two metrics commonly used in Meteorology (Brier score and MSE), and the importance of "sharpness" both in HEP and in that domain. I finally point out that HEP distribution fits should be optimized and evaluated using probabilistic metrics (like FIP or MSE), whereas ranking metrics (like AUC) or threshold metrics (like accuracy) are of limited relevance for these specific problems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/01/2022

GAN-MC: a Variance Reduction Tool for Derivatives Pricing

We propose a parameter-free model for estimating the price or valuation ...
research
05/31/2021

The use of Generative Adversarial Networks to characterise new physics in multi-lepton final states at the LHC

Semi-supervision in Machine Learning can be used in searches for new phy...
research
01/20/2020

A Monte Carlo EM Algorithm for the Parameter Estimation of Aggregated Hawkes Processes

A key difficulty that arises from real event data is imprecision in the ...
research
08/27/2021

A Parameter Estimation Method for Multivariate Aggregated Hawkes Processes

It is often assumed that events cannot occur simultaneously when modelli...
research
03/09/2021

BROOD: Bilevel and Robust Optimization and Outlier Detection for Efficient Tuning of High-Energy Physics Event Generators

The parameters in Monte Carlo (MC) event generators are tuned on experim...
research
07/25/2021

A Survey of Monte Carlo Methods for Parameter Estimation

Statistical signal processing applications usually require the estimatio...
research
04/02/2023

SoftED: Metrics for Soft Evaluation of Time Series Event Detection

Time series event detection methods are evaluated mainly by standard cla...

Please sign up or login with your details

Forgot password? Click here to reset