Targeted empirical Bayes for more supervised joint factor analysis

Abstract

Joint Bayesian factor models are popular for characterizing relationships between multivariate correlated predictors and a response variable. Standard models assume that all variables, including both the predictors and the response, are conditionally independent given latent factors. In marginalizing out these factors, one obtains a low rank plus diagonal factorization for the joint covariance, which implies a linear regression for the response given the predictors. Although there are many desirable properties of such models, these methods can struggle to identify the signal when the response is not dependent on the dominant principal components in the predictors. To address this problem, we propose estimating the residual variance in the response model with an empirical Bayes procedure that targets predictive performance of the response given the predictors. We illustrate that this can lead to substantial improvements in simulation performance. We are particularly motivated by studies assessing the health effects of environmental exposures and provide an illustrative application to NHANES data.

0

Turn this paper into a full lesson

ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…