Sequential Correct Screening and Post-Screening Inference
Abstract
Selecting the top-m variables with the m largest population parameters from a larger set of candidates is a fundamental problem in statistics. In this paper, we propose a novel methodology called Sequential Correct Screening (SCS), which sequentially screens out variables that are not among the top-m. A key feature of our method is its anytime validity; it provides a sequence of variable subsets that, with high probability, always contain the true top-m variables. Furthermore, we develop a post-screening inference (PSI) procedure to construct confidence intervals for the selected parameters. Importantly, this procedure is designed to control the false coverage rate (FCR) whenever it is conducted -- an aspect that has been largely overlooked in the existing literature. We establish theoretical guarantees for both SCS and PSI, and demonstrate their performance through simulation studies and an application to a real-world dataset on suicide rates.
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.