Characterizing Stellar Streams with Error-Aware Machine Learning
Abstract
Stellar streams are thin, elongated collections of stars formed by gravitational disruption of orbiting star clusters or dwarf galaxies and are highly sensitive probes of the Milky Way's dark matter distribution and formation history. We present SCREAM (Stream ChaRacterization with Error Aware Machine Learning), a weakly-supervised framework to identify member stars of stellar streams. Building on the CATHODE method originally developed for particle physics, SCREAM identifies streams as localized feature-space over-densities, avoiding rigid physical priors like assumed gravitational potentials or strict isochrone filtering. Crucially, SCREAM is the first machine learning (ML) framework in this domain to directly incorporate observational uncertainties into the neural network training objective. Using astrometric and photometric data from Gaia Data Release 3 and the Dark Energy Spectroscopic Instrument (DESI) Legacy imaging survey, we demonstrate our algorithm's performance on the prominent GD-1 stream. Validated against independent labels, SCREAM achieves an F1 score of 0.745, substantially outperforming existing ML methods in both precision and recall. Furthermore, SCREAM recovers the physically expected diffuse "cocoon" of GD-1 and faint main-sequence members that classical physics-based algorithms (e.g., STREAMFINDER) miss. Our results highlight the transformative potential of uncertainty-aware, weakly-supervised ML to uncover complex galactic structures.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.