Design Principles for AI-Ready QCD Data with a Barrel Imaging Calorimeter Application

Abstract

Data from large physics collider experiments in Quantum Chromodynamics (QCD) research differ fundamentally from the modalities used in modern foundation models. The heterogeneity of detector readouts and their technology dependence require principled curation for cross experiment AI applications. We present a design framework for AI-ready QCD data to define a unified data structure that accommodates heterogeneous detector technologies within a single schema. We apply the design principle to the simulated data of the Barrel Imaging Calorimeter (BIC) in the ePIC detector at the Electron--Ion Collider. The BIC simulation data combines AstroPix silicon pixel imaging layers with Pb/ScFi calorimeter layers across different readout types. We describe the schema specialization, data preparation pipeline, and visualization of the curated AI-ready dataset.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…