Class Imbalance Techniques for High Energy Physics

05/01/2019
by   Christopher W. Murphy, et al.
0

A common problem in high energy physics is extracting a signal from a much larger background. Posed as a classification task, there is said to be an imbalance in the number of samples belonging to the signal class versus the number of samples from the background class. Techniques for learning from imbalanced data are well established in the machine learning community. In this work we provide a brief overview of class imbalance techniques in a high energy physics setting. Two case studies are presented: (1) the measurement of the longitudinal polarization fraction in same-sign WW scattering, and (2) the decay of the Higgs boson to charm-quark pairs. We find a significant improvement in the performance of the machine learning models used in the longitudinal WW study, while no significant improvement in performance is found in the deep learning models tested. Our charm-quark tagger gives a 14 improvement in the background rejection rate.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/01/2023

CAISA at SemEval-2023 Task 8: Counterfactual Data Augmentation for Mitigating Class Imbalance in Causal Claim Identification

The class imbalance problem can cause machine learning models to produce...
research
09/01/2021

An Empirical Study on the Joint Impact of Feature Selection and Data Resampling on Imbalance Classification

Real-world datasets often present different degrees of imbalanced (i.e.,...
research
10/30/2018

Weak-supervision for Deep Representation Learning under Class Imbalance

Class imbalance is a pervasive issue among classification models includi...
research
07/23/2020

SeismoGlow – Data augmentation for the class imbalance problem

In several application areas, such as medical diagnosis, spam filtering,...
research
07/05/2017

Development & Implementation of the Trigger for a Short-baseline Reactor Antineutrino Experiment (SoLid)

SoLid, located at SCK-CEN in Mol, Belgium, is a reactor antineutrino exp...
research
12/04/2018

Bad practices in evaluation methodology relevant to class-imbalanced problems

For research to go in the right direction, it is essential to be able to...
research
09/01/2019

An Efficient Convolutional Neural Network for Coronary Heart Disease Prediction

This study proposes an efficient neural network with convolutional layer...

Please sign up or login with your details

Forgot password? Click here to reset