Speeding HEP Analysis with ROOT Bulk I/O

06/11/2019
by   Brian Bockelman, et al.
0

Distinct HEP workflows have distinct I/O needs; while ROOT I/O excels at serializing complex C++ objects common to reconstruction, analysis workflows typically have simpler objects and can sustain higher event rates. To meet these workflows, we have developed a "bulk I/O" interface, allowing multiple events data to be returned per library call. This reduces ROOT-related overheads and increases event rates - orders-of-magnitude improvements are shown in microbenchmarks. Unfortunately, this bulk interface is difficult to use as it requires users to identify when it is applicable and they still "think" in terms of events, not arrays of data. We have integrated the bulk I/O interface into the new RDataFrame analysis framework inside ROOT. As RDataFrame's interface can provide improved type information, the framework itself can determine what data is readable via the bulk IO and automatically switch between interfaces. We demonstrate how this can improve event rates when reading analysis data formats, such as CMS's NanoAOD.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/17/2020

Evolution of the ROOT Tree I/O

The ROOT TTree data format encodes hundreds of petabytes of High Energy ...
research
03/02/2021

Coffea-casa: an analysis facility prototype

Data analysis in HEP has often relied on batch systems and event loops; ...
research
04/16/2020

Continuous Health Interface Event Retrieval

Knowing the state of our health at every moment in time is critical for ...
research
06/20/2023

PyRCA: A Library for Metric-based Root Cause Analysis

We introduce PyRCA, an open-source Python machine learning library of Ro...
research
04/18/2021

Schema Curation via Causal Association Rule Mining

Event schemas are structured knowledge sources defining typical real-wor...
research
06/14/2003

OO Model of the STAR offline production "Event Display" and its implementation based on Qt-ROOT

The paper presents the "Event Display" package for the STAR offline prod...
research
08/09/2023

Xenophobic Events vs. Refugee Population – Using GDELT to Identify Countries with Disproportionate Coverage

In this preliminary study, we used the Global Database of Events, Langua...

Please sign up or login with your details

Forgot password? Click here to reset