The BarYon CYCLE Project (ByCycle): Identifying and Localizing MgII Metal Absorbers with Machine Learning
Abstract
The upcoming ByCycle project on the VISTA/4MOST multi-object spectrograph will offer new prospects of using a massive sample of 1 million high spectral resolution (R = 20,000) background quasars to map the circumgalactic metal content of foreground galaxies (observed at R = 4000 - 7000), as traced by metal absorption. Such large surveys require specialized analysis methodologies. In the absence of early data, we instead produce synthetic 4MOST high-resolution fibre quasar spectra. To do so, we use the TNG50 cosmological magnetohydrodynamical simulation, combining photo-ionization post-processing and ray tracing, to capture MgII (λ2796, λ2803) absorbers. We then use this sample to train a Convolutional Neural Network (CNN) which searches for, and estimates the redshift of, MgII absorbers within these spectra. For a test sample of quasar spectra with uniformly distributed properties (λMgII,2796, EWMgII,2796rest = 0.05 - 5.15 , SNR = 3 - 50), the algorithm has a robust classification accuracy of 98.6 per cent and a mean wavelength accuracy of 6.9 . For high signal-to-noise spectra (SNR > 20), the algorithm robustly detects and localizes MgII absorbers down to equivalent widths of EWMgII,2796rest = 0.05 . For the lowest SNR spectra (SNR=3), the CNN reliably recovers and localizes EWMgII,2796rest ≥ 0.75 \, absorbers. This is more than sufficient for subsequent Voigt profile fitting to characterize the detected MgII absorbers. We make the code publicly available through GitHub. Our work provides a proof-of-concept for future analyses of quasar spectra datasets numbering in the millions, soon to be delivered by the next generation of surveys.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.