Empirical Bayes large-scale multiple testing for high-dimensional binary outcome data

Abstract

This paper explores the multiple testing problem for sparse high-dimensional data with binary outcomes. We propose novel empirical Bayes multiple testing procedures based on a spike-and-slab posterior and then evaluate their performance in controlling the false discovery rate (FDR). A surprising finding is that the procedure using the default conjugate prior (namely, the -value procedure) can be overly conservative in estimating the FDR. To address this, we introduce two new procedures that provide accurate FDR control. Sharp frequentist theoretical results are established for these procedures, and numerical experiments are conducted to validate our theory in finite samples. To the best of our knowledge, we obtain the first uniform FDR control result in multiple testing for high-dimensional data with binary outcomes under the sparsity assumption.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…