Reweighting with Boosted Decision Trees

08/20/2016
by   A. Rogozhnikov, et al.
0

Machine learning tools are commonly used in modern high energy physics (HEP) experiments. Different models, such as boosted decision trees (BDT) and artificial neural networks (ANN), are widely used in analyses and even in the software triggers. In most cases, these are classification models used to select the "signal" events from data. Monte Carlo simulated events typically take part in training of these models. While the results of the simulation are expected to be close to real data, in practical cases there is notable disagreement between simulated and observed data. In order to use available simulation in training, corrections must be introduced to generated data. One common approach is reweighting - assigning weights to the simulated events. We present a novel method of event reweighting based on boosted decision trees. The problem of checking the quality of reweighting step in analyses is also discussed.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/28/2021

SONG: Self-Organizing Neural Graphs

Recent years have seen a surge in research on deep interpretable neural ...
research
12/20/2014

Using Neural Networks for Click Prediction of Sponsored Search

Sponsored search is a multi-billion dollar industry and makes up a major...
research
06/02/2022

NeuralSympCheck: A Symptom Checking and Disease Diagnostic Neural Model with Logic Regularization

The symptom checking systems inquire users for their symptoms and perfor...
research
01/16/2020

Extracting more from boosted decision trees: A high energy physics case study

Particle identification is one of the core tasks in the data analysis pi...
research
10/19/2017

Decision Trees for Helpdesk Advisor Graphs

We use decision trees to build a helpdesk agent reference network to fac...
research
06/19/2022

Generational Differences in Automobility: Comparing America's Millennials and Gen Xers Using Gradient Boosting Decision Trees

Whether the Millennials are less auto-centric than the previous generati...
research
06/29/2020

Handling Missing Data in Decision Trees: A Probabilistic Approach

Decision trees are a popular family of models due to their attractive pr...

Please sign up or login with your details

Forgot password? Click here to reset