Posterior Asymptotic Normality for an Individual Coordinate in High-dimensional Linear Regression
Abstract
We consider the sparse high-dimensional linear regression model Y=Xb+ε where b is a sparse vector. For the Bayesian approach to this problem, many authors have considered the behavior of the posterior distribution when, in truth, Y=Xβ+ε for some given β. There have been numerous results about the rate at which the posterior distribution concentrates around β, but few results about the shape of that posterior distribution. We propose a prior distribution for b such that the marginal posterior distribution of an individual coordinate bi is asymptotically normal centered around an asymptotically efficient estimator, under the truth. Such a result gives Bayesian credible intervals that match with the confidence intervals obtained from an asymptotically efficient estimator for bi. We also discuss ways of obtaining such asymptotically efficient estimators on individual coordinates. We compare the two-step procedure proposed by Zhang and Zhang (2014) and a one-step modified penalization method.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.