Coffea – Columnar Object Framework For Effective Analysis

08/28/2020
by   Nicholas Smith, et al.
0

The coffea framework provides a new approach to High-Energy Physics analysis, via columnar operations, that improves time-to-insight, scalability, portability, and reproducibility of analysis. It is implemented with the Python programming language, the scientific python package ecosystem, and commodity big data technologies. To achieve this suite of improvements across many use cases, coffea takes a factorized approach, separating the analysis implementation and data delivery scheme. All analysis operations are implemented using the NumPy or awkward-array packages which are wrapped to yield user code whose purpose is quickly intuited. Various data delivery schemes are wrapped into a common front-end which accepts user inputs and code, and returns user defined outputs. We will discuss our experience in implementing analysis of CMS data using the coffea framework along with a discussion of the user experience and future directions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/08/2022

An array-oriented Python interface for FastJet

Analysis on HEP data is an iterative process in which the results of one...
research
11/05/2020

ARGG-HDL: A High Level Python Based Object-Oriented HDL Framework

We present a High-Level Python-based Hardware Description Language (ARGG...
research
03/22/2021

hep_tables: Heterogeneous Array Programming for HEP

Array operations are one of the most concise ways of expressing common f...
research
01/22/2019

Using Big Data Technologies for HEP Analysis

The HEP community is approaching an era were the excellent performances ...
research
08/31/2023

Last Mile Delivery with Drones and Sharing Economy

We consider a combined system of regular delivery trucks and crowdsource...
research
09/23/2019

Machine Learning Pipelines with Modern Big Data Tools for High Energy Physics

The effective utilization at scale of complex machine learning (ML) tech...
research
09/23/2019

Machine Learning Pipelines with Modern Big DataTools for High Energy Physics

The effective utilization at scale of complex machine learning (ML) tech...

Please sign up or login with your details

Forgot password? Click here to reset