AGNBoost: A Machine Learning Approach to AGN Identification with JWST/NIRCam+MIRI Colors and Photometry
Abstract
We present AGNBoost, a machine learning framework utilizing XGBoostLSS to identify AGN and estimate redshifts from JWST NIRCam and MIRI photometry. AGNBoost constructs 66 input features from 7 NIRCam and 4 MIRI bands to predict the fraction of mid-IR 3--30\,μm emission attributable to an AGN power law (fracAGN) and photometric redshift. Each model is trained on 106 simulated galaxies from CIGALE. Models are tested on mock CIGALE galaxies, an independent set of empirically-derived templates, and 748 observations from the JWST MIRI EGS Galaxy and AGN (MEGA) survey. On idealized noise-free mock CIGALE galaxies, AGNBoost achieves 15\% outlier fractions of 1.63\% (fracAGN) and 0.15\% (redshift), with σRMSE = 0.045 for fracAGN and σNMAD = 0.004 for redshift. When realistic photometric uncertainties are introduced, performance remains robust with median predictions on the 1:1 relation, though outlier fractions increase to 4.38\% and 3.35\%, respectively. On the independent template set, AGNBoost identifies 92.6\% of AGN candidates with fracAGN > 0.3 and 100\% with fracAGN > 0.5, demonstrating generalization beyond the training distribution. On MEGA galaxies with spectroscopic redshifts, AGNBoost achieves σNMAD = 0.056 and 19.79\% outliers. AGNBoost fracAGN estimates broadly agree with CIGALE fitting (σRMSE = 0.178, 11.96\% outliers). The flexible framework allows straightforward incorporation of additional photometric bands and re-training for other variables. AGNBoost's computational efficiency makes it well-suited for wide-sky surveys requiring rapid AGN identification and redshift estimation.
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.