Nash: Neural Adaptive Shrinkage for Structured High-Dimensional Regression
Abstract
Sparse linear regression is a fundamental tool in data analysis. However, traditional approaches often fall short when covariates exhibit structure or arise from heterogeneous sources. In biomedical applications, covariates may stem from distinct modalities or be structured according to an underlying graph. We introduce Neural Adaptive Shrinkage (Nash), a unified framework that integrates covariate-specific side information into sparse regression via neural networks. Nash adaptively modulates penalties on a per-covariate basis, learning to tailor regularization without cross-validation. We use a split variational empirical Bayes algorithm that decouples prior learning from posterior inference, reducing the M-step from O(p) neural-network passes per sweep to a single batched pass, a 74 to 106x wall-clock speedup over previously proposed coordinate ascent CAVI for p between 102 and 104. Experiments on real data demonstrate that Nash improves accuracy and adaptability over existing methods.
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.