Bias Correction for Semiparametric Regression Models
Abstract
We consider a broad class of semiparametric regression models in which the conditional distribution of the response takes the form f\Y|x Tβ+m(z), φ\, which is known up to a parametric component β of diverging dimension p, a smooth function m(·), and a dispersion parameter φ. Existing semiparametric literature on such models has primarily focused on semiparametric efficiency for β, typically treating φ and m(·) as nuisances and largely ignoring their finite-sample bias. However, the finite-sample bias of standard estimators can be substantial (especially when p is large relatively to n and/or dispersion is high) and can seriously undermine inference for β. Moreover, φ is often of direct scientific interest and requires accurate estimation. To address this gap, we propose SABRE, a simulation-based bias correction framework for this broad semiparametric model class. We establish asymptotic properties of SABRE for the subclass of generalized partially linear models, where bias reduction for β and φ can be achieved without inflating variance, and we outline how the underlying principle may be adapted more generally. Comprehensive simulation studies and a real-data application on early-stage diabetes demonstrate the empirical effectiveness of SABRE in reducing bias and improving inference.
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.