Scalable Generalised Accuracy Estimation for Multisource Register-based Official Statistics
Abstract
Official statistics are undergoing a significant transformation, as national statistical institutes transition from traditional single-source data production systems to integrated systems of statistical registers combining administrative, census, and survey data. The resulting multisource register-based estimates are prone to multiple interacting sources of error, yet rigorous scalable frameworks for quantifying their accuracy remain underdeveloped. This work discusses and validates a global measure of error assessment for such multisource register-based statistics. Focusing on two central sources of uncertainty, sampling and modelling, we derive an analytical solution that accurately approximates the global error of mass-imputation procedures under a multinomial logistic model. The proposed measure is interpretable, flexible, and computationally scalable, enabling on-the-fly accuracy quantification for user-defined, unplanned domain-specific statistics on population totals. Its validity is established theoretically and confirmed through simulation studies. An application to education data from the Italian National Institute of Statistics is presented.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.