Prospects for Deep-Learning-Based Mass Reconstruction of Ultra-High-Energy Cosmic Rays using Simulated Air-Shower Profiles
Abstract
Knowledge of the mass composition of ultra-high-energy cosmic rays is crucial to understanding their origins; however, current approaches have limited event-by-event resolution. With fluorescence telescope measurements of the longitudinal shower profile, there are opportunities to improve this situation by applying Machine Learning (ML) to leverage more information beyond Xmax alone. To our knowledge, we present the first study of a deep-learning neural-network approach to predict a primary's mass (A) directly from the longitudinal energy-deposit profile of simulated extensive air showers. We train and validate our model on simulated showers, generated with CONEX and EPOS-LHC, covering nuclei from A = 1 to 61, sampled uniformly in A. After rescaling, our network achieves a maximum bias better than 0.4 in A with a resolution between 1.5 for protons and 1 for iron, corresponding to a proton-iron Merit Factor of 2.19 (AUC = 0.976). We benchmark this against simpler ML models trained on profile-shape parameters (X max, Ecal, R, and L) extracted from the same data. We find that even simple models can substantially exceed published benchmarks for combinations of these observables, demonstrating that ML methods applied even to standard profile-shape parameters can significantly improve available mass sensitivity. The CNN outperforms this strong baseline, and this performance is only mildly degraded when cross-predicting on simulations made with the Sibyll-2.3d hadronic interaction model, showing robustness against model choice. The network also maintains its performance across a wide range of noise conditions. An ablation study further demonstrates that the full profile contains composition-sensitive structure not captured by the GH parameterization, while the strong performance of the CNN suggests this information should be resolvable in real events.
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.