Leveraging universality of jet taggers through transfer learning

03/11/2022
by   Frédéric A. Dreyer, et al.
0

A significant challenge in the tagging of boosted objects via machine-learning technology is the prohibitive computational cost associated with training sophisticated models. Nevertheless, the universality of QCD suggests that a large amount of the information learnt in the training is common to different physical signals and experimental setups. In this article, we explore the use of transfer learning techniques to develop fast and data-efficient jet taggers that leverage such universality. We consider the graph neural networks LundNet and ParticleNet, and introduce two prescriptions to transfer an existing tagger into a new signal based either on fine-tuning all the weights of a model or alternatively on freezing a fraction of them. In the case of W-boson and top-quark tagging, we find that one can obtain reliable taggers using an order of magnitude less data with a corresponding speed-up of the training process. Moreover, while keeping the size of the training data set fixed, we observe a speed-up of the training by up to a factor of three. This offers a promising avenue to facilitate the use of such tools in collider physics experiments.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/18/2015

Domain Adaptation and Transfer Learning in StochasticNets

Transfer learning is a recent field of machine learning research that ai...
research
12/15/2020

Jet tagging in the Lund plane with graph networks

The identification of boosted heavy particles such as top quarks or vect...
research
10/08/2022

Don't Waste Data: Transfer Learning to Leverage All Data for Machine-Learnt Climate Model Emulation

How can we learn from all available data when training machine-learnt cl...
research
01/21/2020

Transfer Learning using Neural Ordinary Differential Equations

A concept of using Neural Ordinary Differential Equations(NODE) for Tran...
research
08/23/2018

Transfer Learning for Estimating Causal Effects using Neural Networks

We develop new algorithms for estimating heterogeneous treatment effects...
research
03/18/2017

Transfer Learning for Sequence Tagging with Hierarchical Recurrent Networks

Recent papers have shown that neural networks obtain state-of-the-art pe...
research
05/05/2020

Predicting atmospheric optical properties for radiative transfer computations using neural networks

The radiative transfer equations are well-known, but radiation parametri...

Please sign up or login with your details

Forgot password? Click here to reset