OTCliM: generating a near-surface climatology of optical turbulence strength (Cn2) using gradient boosting
Abstract
This study introduces OTCliM (Optical Turbulence Climatology using Machine learning), a novel approach for deriving comprehensive climatologies of atmospheric optical turbulence strength (Cn2) using gradient boosting machines. OTCliM addresses the challenge of efficiently obtaining reliable site-specific Cn2 climatologies near the surface, crucial for ground-based astronomy and free-space optical communication. Using gradient boosting machines and global reanalysis data, OTCliM extrapolates one year of measured Cn2 into a multi-year time series. We assess OTCliM's performance using Cn2 data from 17 diverse stations in New York State, evaluating temporal extrapolation capabilities and geographical generalization. Our results demonstrate accurate predictions of four held-out years of Cn2 across various sites, including complex urban environments, outperforming traditional analytical models. Non-urban models also show good geographical generalization compared to urban models, which capture non-general site-specific dependencies. A feature importance analysis confirms the physical consistency of the trained models. It also indicates the potential to uncover new insights into the physical processes governing Cn2 from data. OTCliM's ability to derive reliable Cn2 climatologies from just one year of observations can potentially reduce resources required for future site surveys or enable studies for additional sites with the same resources.