Harvesting the Lyα\ forest with convolutional neural networks

Abstract

We develop a machine learning based algorithm using a convolutional neural network (CNN) to identify low HI column density Lyα absorption systems (NHI/ cm-2<17) in the Lyα forest, and predict their physical properties, such as their HI column density (NHI/ cm-2), redshift (zHI), and Doppler width (bHI). Our CNN models are trained using simulated spectra (S/N 10), and we test their performance on high quality spectra of quasars at redshift z2.5-2.9 observed with the High Resolution Echelle Spectrometer on the Keck I telescope. We find that 78\% of the systems identified by our algorithm are listed in the manual Voigt profile fitting catalogue. We demonstrate that the performance of our CNN is stable and consistent for all simulated and observed spectra with S/N 10. Our model can therefore be consistently used to analyse the enormous number of both low and high S/N data available with current and future facilities. Our CNN provides state-of-the-art predictions within the range 12.5≤NHI/cm-2<15.5 with a mean absolute error of (NHI/ cm-2)=0.13, (zHI)=2.7×10-5, and (bHI)=4.1\ km\ s-1. The CNN prediction costs <3 minutes per model per spectrum with a size of 120\,000 pixels using a laptop computer. We demonstrate that CNNs can significantly increase the efficiency of analysing Lyα forest spectra, and thereby greatly increase the statistics of Lyα absorbers.

0

Discussion (0)

Sign in to join the discussion.

Loading comments…