Propagating data noise through the fit: the Monte Carlo replica distribution
Abstract
The Monte Carlo (MC) replica method quantifies parameter uncertainties in global fits of parton distribution functions (PDFs) and Standard Model Effective Field Theory (SMEFT) Wilson coefficients by fitting a model to many noise-perturbed copies of the data and taking the empirical distribution of the best-fit parameters as the uncertainty. The method reproduces the Bayesian posterior exactly only when the model is linear in its parameters, and departs from it in the nonlinear case. We derive the leading-order distribution the method produces and compare it with the Laplace approximation of the Bayesian posterior: the two differ by a single computable matrix, the residual-weighted Hessian of the model at the best fit, whose sign and magnitude set the over- or under-estimation of the parameter uncertainties. This closed-form expression quantifies when and by how much the MC method departs from Bayesian inference. We illustrate it on two single-parameter examples solvable in closed form and point to its evaluation in full PDF and SMEFT fits as a natural next step.
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.